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: Conference Services Office 
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Rt © Box Ji 
Santa Fe, WM 875¢i 
Dear Mr. Vasulkas 
Thank you very much for entering the SIGGRAPH "85 Ort Shew. There 
Were over GOW entries this year from argund the world. 
The following artwork has been acceoted by the SIGGRAPH 785 qurying a 


committee far display in the art show: 


The Co onmission, i984, Video 


In order for us to frame and mount the work in a timely fashion, we 

are requesting that you send your artwork toa our office by June 6. 
Please insure the work in transit, and let us know the insurance value 

af your work. Ship the pieces in either a pretected flat shipser or a 


mailing tube. He certain that the corners and edges are well oadcec. 
We will mount and frame the work and returned it framed and insured tr 
You. While we are taking great care in mounting and framing the work, 
please be advised that due ta our budget limitations we carmot provide 


a true museum quality or archival mounting. 


Artists working in large-scale format or previding instal 
or videotapes for the show will receive instructions for shipo3 


ac) of their work by ohone. # 


We le work forward to seeing you at SIGGRAPH '85, and thank vou fer your 
contribution to what we believe iS A remarkvable exhibition. 
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April 15,1982 


oN : ve | 
Dear Shennan X Wowdy 
Thank you for submitting your Gideoy fila work toa the SIGGRAPH’ 2 
Art Show. The response for the call for art was averwhelming. 
We would like ta include the following video/film piecels) in 


the exhibition: 


vdeo Frye 


Works will be compiled an 2/4" video tape for continuous showince 
fram submitted tapes. If you have a better copy of your piece, 
Dlease contact us so that we can make arrangements to get it. 
®l1l work will be insured by SIGGRAPH’ SS. Original material will 
be returned upon completian of the SIGGRAPH’ SS Art Show edit. 
The SIGGRAPH’ S2 Art Show edit will be destroyed after the confer 
ence. 


Flease send us slides of stills from the accepted work for poss- 
ible reproduction in the SIGGRAFH’S2 Art Show catalog and/ar 
coaster. Send slides (anly) ta: 


Louise Etra/SIGGRAFH’ 82 Art Show 
c/o GEST 

1440 San FPabla Avenue 

Rerkeley, CA 94702 

Tel: 415/527-7700 


Send videotapes and/ar films ta: 


fooper Gileath 

Real Time Design, Inc. 
S21 S. Flymouth Ct., #102 
Chicago, IL 40608 

Tel. 212/662 /0384 


Acceptance of your work entitles you far ane of the other 
artists if any? to a camplimentary Siggraph’ 82 registratian 
worth $175. You will receive informatian under separate cover. 


Yours truly 2 a 
(7 lott 


Capp Giloth 
SIGGRAPH" S2 Art Shaw Committee Chair 
S12 / S63 /0584 
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Dear Vash Siggraph Video Review Contributor: 


Mermep—thanks—fes the-meterteals you left me at—SieETapn’ oo. 
I just want to be sure you understand the nature of the 
Siggraph Video Review (SVR) and I have a few other avenues 
of distribution to ask you about. 


First, the SVR has been sent all over the world. Nearly 1000 
tapes have been distributed at cost. As the enclosed flyer 
indicates, four hours of material have been available for a year 
now. The primary customers have been schools, libraries, and 
computer graphics companies. No abuses of the material have 
occurred to my knowledge, as yet. With your help, I plan to 
have two more hours ready soon and another two hours about 
January. 


If you choose not to have your work considered public domain, 
please include a proper copyright notice on the enclosed form 

if your film/tape does not have one already. (Example: 

Copyright 1982 J.Q.Graphix.) Besides the legalistic implications, 
a copyright notice means people must get your explicit permission 
to make further copies, edits, ete. of your work from the SVR 
copies. If, on the other hand, you wish not to be bothered with 
requests, you may either skip the copyright notice or qualify it 
with a statement like "permission granted in advance for use 

by newsmedia and educational institutions," for example. 

I will also include a character~generated, paragraph of text of 
your choice preceeding your entry on the “SVR af you so indicate 
in the space labeled "Statement." In addition, I will include 


_ your panes address and/or ,PRone number uf you desire. 


w* 
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We have also budgeted enough money to pate auee a one-hour 16mm 


film for use by libraries, science museums and international 
conferences since they do not normally have access to the wonderful 
video projection gear we put together for the Siggraph annual 
conferences. Entries on 16mm film will be cut into a master; 

video entries will be transferred to film at Image Transform to 
assure quality. The film will be available on a loan-only basis 
until further notice. Any suggestions you might have to better 
this process are very welcome. 


People at WGBH in Boston and MIT have expressed interest in 
producing a videodisk of computer graphics with slides, films 
and videotapes on it. Digital data will also be encoded for 
various experiments in random access, branching, etc. Please 
indicate your interest in this project on the form, if any. 


(over ) 





: 


APPLIED Ee 
ENTEITAINVENT 
TECHNOLOGIES 


38855 Lankershim Boulevard Universal City, California 91604 213 VOE>+38OH 
444-3895 


December 8, 1982 


Messrs. Steine, Vasulka & Smith 
Route 6, Box 100 
Santa Fe, New Mexico 87501 


Dear Messrs. Steine, Vasulka & Smith: 


On behalf of the L. A. Chapter Siggraph I would like to thank you for 
your contribution to Siggraph West, held at the Art Center College of 
Design in Pasadena on October 30-31, 1982. Enclosed, for your informa- 
tion, is a brochure of the program. 


The event was more successful than we imagined (or planned for) but it 
would not have been so if it weren't for the videotapes and films that 
were submitted for the show. 


It is yours and all your co-workers' work that shined, and rightfully 
so. The time and energy spent in producing computer-generated imagery 
can sometimes go unrewarded: we hope that this event served as a token 
of appreciation and recognition for all those participating. 


I am proud to say that the tapes and films submitted represented the 
most comprehensive and current accomplishments in computer graphics 
(analogue and digital) in the world! And, because of the registrant 
response, we feel that we can offer, next year, a continuation of this 
historic evolution to a larger audience with a greater professional 
support force. 


The '82 Siggraph West was a West Coast inaugural event dedicated to the 
acknowledgement of achievement (and patience) in realizing "electronic 
perception". 

Believe you me, the show "knocked their socks off". 

Thank you so much. 

Sincerely, | 

Edward R. Arroyo 


ERA:11 
Encls. 
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the computer animation news! 


Volume 2, Number <4 
July, 198e5 


INDUSTRY MAKEUP CHANGING RAPIDLY - 
ARE YOU READY? 


The computer animation industry is changing rapidly - 
and dramatically. 


A quick walk around the Trade Show Exhibit Hall at 
SIGGRAPH 785, with its growing number af software and 
systems exhibitors, gives a good indication of what is 
happening (see our SIGGRAFH '85 analysis on page 5). 
As outlined in cur Arrnual Market Report, the camputer 
animation industry is changing. Many new players are 
antering the field and the number af camputer 
anifiaticn users is grawing daily. 


It is significant that these new entrants into the 
computer animation opreaduction arena are purchasing 
packaged systems. These are either complete “turn-key” 
systems of combined hardware and software, or ane of 
the growing number caf saftware packages now available 
which they can run on mainframes they already have or 
am cone of the new "super-mini" types of computers. And 
many of them are not in the film/videa preduction 
business, yet their uses af computer animaticon fall 
within our definition of this category that excludes 
CAD/CAM and Business Graphics. 


Very few new computer animation producers are willing 
ta invest the time and money required ta develop their 
awn software. To a large degree this is because there 
new 15 sa much excellent saftware available. The alder 
campanies didn’t have the ootian. Mast of it wasn’t 
ever available two years aga, ard what was om the 
Market did nat usually preduce an imace af sufficient 
Quality for TV broadcast work. They had ta develap 
their awn, 


What is alsa changing is the use these new systems are 
being put ta, We use the term "comouter animation" ta 
differentiate our subject from “computer graphics", 
which we use tc refer ta a single frame ar computer 
qenerated image, whereas “computer animation" is used 
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to describe a series af sequential images ar frames Strung tegether to give an 


The industry's. capabilities are alse Changing. We new are talking about mare 
than just the “animation”  oa¢ ifages that are the result of soneone’s 
Imagination of creative acticn. We now have ta Include the werd "Simulation" ir, 
cescribirnc our SuoTect matter - “Sing computer generated images ta "Simulate" 
Scenes, events ard objects from the real worid around us. And because the rea] 
world never stands stili, these Slmulated images are animated images, 


These simulations of real world Images form much af the computer animation 
cutputs that are starting to be used With sane regularity in the Fields of 
medicine, education, architecture res and entertainment. And at SIGGRAPH, more 
and more af the premoticonal works shown at vendor's booths were Of a 
"Simulation" nature thar a nov-realistic or Fantasy production. 


The implicatians af these changes wilj affect everycne involved in the computer 

aniimation/simulaticn area ~ oreducers, designers, Nardware/soaftware Supoliers 

anc users, Just ac taday's industry is very different frem that cof just twa 

years age, this chance will cont iriue, actually accelerate, over the next twa 

years. Farticularly as More  reorm-technical pecpie enter the industry as 
urchasers and users of Packaged “tlirn-key"! systems, 


The result ie that patential technalagical advantages will become less 
LiaDortarnt, Instead, the major Factors in cetermining SUCCESS in the Marketplace 
ae ae 
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be creativity and Market ing Ability. 


PACIFIC DATA IMAGES 


Mention the name Pacific Data Images, and mast pecple who fallow computer 
avilmat ior: usally have visians af clean, sharp, brightly colored computer 
animated on-air SEQUENCES produced for Major Americar, broadcasters, 


Rithoucn ADI 4ec peer Producing ccmouter animation for the better Dart of three 
YEars defore their cutstarnding visuale Far "Enitertainment Teri ht" first hit 
t@ airwaves, that series is what Mary pecDie first remember af their work, 





Started five Years aga prnext month by Carl Reserdani, the company began 
SPeratilans with Just three pessle - Rosendahl, Glen, Ertis and Richard Churg, 
Mey have written all their awn software, with Entis ana Churig Coing mast af 
tae early fourndatinn work. Much has since seen written ar, top af that, bath by 
these two, ard OY a niamber af ather people, This strong emphasis an continuing 
develoorent is te a large degree the result of Rosendahl’. policy of Fostering 
personal developmerit among is staff, a policy which alea PayS Dig dividends 
on a corporate level, 


Uperating with a relatively small, but very dedicated greauo af fifteer pecple, 
Roesencahi naves sure that everyeane has the eppertunity te persue their 
Particular ist Sts, using any computer free-tine tr preaduce animation ar 
Cevelop software, Much af theip werk Ends-uo incorporated in FDI’s client 


Orocuet ie =. 
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A considerable amount af Pacific Data Images’ work is seen cutside af North 
America. They have done several productions for TV Globo in Brazil, as well as 
Supplying their software for TV Gleba'’s new computer animation Facility, and 
melping them set it up. Ancther steady customer is Australia’s Channel 3. 


Aierican customers have ranged from videa games ta the major TY networks. About 
Raif af their design work is dane in-house, and they have a close collaboraticm 
with well known designer Harry Marks, whose name often appears an their 
credits, particularly om broadcast wor. 


FDI’s new facility in Sunnyvale, California, 4@ miles south of San Francisca, 
seems to reflect their corpcrate sersanality. It is a Bright, airy, starnd~alcne 
Ouilding situated in an industrial park surrounded by green lawns and shrubs. 
At the Suilding’s core is the comouter room housing their nine Ridge computers, 
ane IMi S80 computer far mation design ard seven Raster Technelagies frame 
nuffers. An interesting feature of tneir layout is three 'client roams’, each 
with a work station, oaffering facilities for client interaction with privacy 
and camfart, 


The building is completely '’wired’, which means they can route signals ta any 
workstation in the building fram any combination af computers. OQutput is via 
three video ceantrollers they built themselves, two for 3/4" video for test 
shots and one for final production onta a Sony BVU Fama i” Videutape recorder, 
Most of their work 15 oreduced onto video at Siz x Sl2 lines resolution, which 
Rosendahl believes adequately covers most of their requirements, hewever, they 
Can digitize images at up ta aaa x FAAA lines resolution and autpout anta film 
Via a Celcn recorder. 


They can alse input from video or tape and digitize these images far 
conbination with, or inclusien inta their animaticn. 


As a client, you won't find anything toc different fram regular industry 
preductlonm precedures at FDI. Fraductians fallow the normal steps af vectar 
mation tests, raster test frames, aoorndvals and final praducticons, and on 
average are completed in 6&6 ta 8 weeks. What we da think you will find different 
is the way they interact with you. They are a friendly, enthusiastic and very 
competent group, and with only 15 people in the company, you will get a Feeling 
oF real involvement. 


Absacugn they have sold a their software ta TV Gleabo and have enough saftware 
ta put together a comprehensive package, this is nat an area FDI is actively 
pursuing. They dean’t have the vecple available tea package or market their 
system, cr provide fallaw-up suppert, and they dan’t want to detract from their 
Main Business of computer animation oreductian. However, they orabably wouldn’t 
turn down any saies that walked threaugh the dacr. 


Pacific Pata Images have alse ret been tac active in the areas of advertising 
anc entertainment praducticrn. But this they doa want to change, being 
particularly interested in doing more work for advertisers and their agencies. 
They will run up against quite a few camoetitors in vying far the advertiser’s 
daliars, same of them sizable arganizations with sales staffs almast as large 
as PDI’s total company, so it won’t be easy. 
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But then Facific Data _ images has exceilent credentials which are exhibited in 
their demo tapes, and a long list of prestigecus awards which always appeals ta 
advertising agencies. Sa we think we will sacn be seeing some of their 


excellent computer animation used by major advertisers and more television. 


commercials with PDI’s name listed an their credits. 


Cacific Data Images 
flii Karlstad Drive, Sunnyvale CA, USA 94789. (488) 745-6755 


* * * 


Carl Resendahl started Pacific Data Images because he figured it was the only 
way he cauld get inta the computer animation business. 


"Back when I started the company five years agca, there weren't that Many pecaple 
doing it and [ couidn’t really figure cut why anyane wauld hire me anyway. I 
mean, Io was a kid aut of Stanford with a Batchelor’s Degree in Electrical 
Engineering and some film making experience from my high school days down in 
Los Angeles. Sa TI figured the anly way I was going to get a job in this 
industry was to make my awn. " 


That wind of "gutsy" aopreacn is still very much evident at FDI today. With a 
staff af omiy fifteen, and a psroducticn load that some of their larger 
competitors yet aspire tc, Raserndanl maintains a policy of allowing his peaple 
to have a certain amount caf time between praduction jobs to wark on their own 
projects - animation, developing scaftware, any type of preaject or whatever eise 
they want. “We get a let of little @ ta S second experiments, but we try to 
encourage pecple ta da larger projects. Grd really encourage people if they 
start a larger job to finish it, allowing a certain amount of time far them ta 
get it together afterwards. " 


This almost academic apprmach is quite different from the policy at mast ather 
camputer animation facilities where enplcayees have cansiderably less 
Flexibility and opportunity toa pursue their awn works. It says volumes about 
how Rosendahl runs his company - and it does result in a continuing stream af 
bright ideas that the company can tap into, and some excellent test pieces ta 
shaw on their demo tape, 


Anather area where Reserndahl differs fram many af his cecolleagues is in his 
defence of praduction directly onte videatape for broadcast/video use. "I dent 
agree that you get a cleaner look on film. You get a cleaner look by gqaing 
Girect ta videc because there is nothing that you are going ta put aon film that 
is going to get transferred ta video that yau can’t out directiy onto tape. The 
only reason I can say that they would do that is ta pet the higher resalutian 
of film, But by the time you have transferred ta tane you have last that 
resalutian anyway. 


"If you are going direct ta video you have all your colors available te yolls 
Film has a certain color range that it can reproduce and it worn’t reproduce 
everything you have when you go to it. Nor will NTSC. On an RGB monitor you get 
a beautiful yellow but you will never be able ta get that om NTSC. Film ta tape 
you lose some more, because there are a lat af colors and cantrasts on film 
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mato you can’t get oon tace. So what you Finally end uo with by the time you've 
gare ta fiim and back ta tape is only the colors and contrasts that can be 
resraducec an 50th fiim anc tape. ‘ 


Ps for Aig future olans, Roserndani sees continued steady grewth. ‘We'll 
certainiy comtirnue what we are doing. We naver'+t been very visible in the 
aGvertising world and we're going tc start becoming mucn mare active there. 
reyand that we’li orabably age cut inta other areas. There’s a wide Varlety af 
industries aut there in addition to the ones we're into now. We are always 
talking to various individuals in stner businesses, | 


Yet ne aces a word of caution. "Meen in mind we’re just 15 pecpie here. We've 
gat to be xinc ar Susy." Not that Roserndahi, or his associates, are afraid af 
hard work. it's part af his phileasephy, which shows to regularly in their work. 
“You dc it oecause you get ta try something new and different. and really have 
A gacd time with 2t." 


SIGGRAPH 7285 


7 tony de Peltrie, Abel Image Researcn, Lucasfilm's Pixar, 3-D cemputer animatican 
C aria systems, saftware and more systems. Siggraph '85 at a glance. 


ft every graphics confererce, expasiticon and festival, the biggest crowds are 
wsuaily found at the computer arimation showings. Siggraph %°'85 was) nia 
exceotion, and the two Siggest attractions were the outstanding computer 
generavec Character animation Gisplayed in “Tony de FPeltrie", a 7 minute and 5a 
second Film oreaduced oy four young men at the Universite de Mentreal (see 
Following article), anc the 3-D stereo computer animation film produced for the 
ritacni “avilliorn at Expo ’85 in Tsukuba by Digital Fraductians., 

The Hitachi 3-D stereca fiim, viewed with the heip of special polarized glasses, 
gave an excelient feeling af being immersed in the scenes, with viewers ducking 
as images from the screen seemed to come right cut at them. A nan 3-D versian 
was alse shewn on the videcn monitors. 


Gtner mneoteworthy showirigs were Kicwin’ir: the Wind by Bill Reeves of Lucasfilm 
wich siniuiated a grassy field con a windy day, Chasing the Rainbow by nee 
Miyagazio arecuced on the Links 1 system at Osaka University and Tuber’s Two 
step oy Chris Wedee af Ohia State University. The commercial dema reels were, 
as always, visual and technical ma&sterpieces, with same of the mest interesting 
work shown resuiting fram variaus employee's own after-hours praductions. 
Pacific Data Images gathered all theirs onto a soecial reel separate from their 
cammercia. shaw reel, 





Gy the Exhibition Hall fleor the twa thotsoots’? that attracted the BIG crawds 
yy were Lucasfiln’s Fuxar sooth (see sur Feb/85 issue) and the Abel Imane Research 
C sooth. Tneir displays were well presented, but it was the glory attached ta 
their names that drew pecple ta them. However, the real thrust of the 
Exhibition, and they were very much a oart af it, was in the areas that confirm 
the direction the computer animation, graphics and simulation industry is 
heading - turn-key systems ard saftware packages, 
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The turn-key systems ara software packages displayed cavered a wide range of 
sapnistication and cost. At the law end ef the snectrum is the simple paint 
saftware that runs on your personal computer and is coperated thraugh keyboard, 
small digitizing tablet and/or ligntpen, usually requiring the additicn af a 
sraphics beard toa your PC that can cast from USS1,aue ta USS$5, aaa. 


Next are the ¢-D and 3-D systems based an an TBM PC or compatible as a central 
processar, with saftware and peripherals adced as part of the package. 
Cubiecap, ane of the leaders in this area, demenstrated a considerably upgraded 
system, with impraved resolution, capabilities and perioherals. Its base price 
is US#e9, 580. With add-ons you can turn it into a full graphics ar animation 
orocucticn system. Their saftware was actually used in the production af images 
and effects for the films “24@1a" and “Starman”. 


For more money you start getting inta the mini-computer Based systems. Prices 
vary widely here, fram US$49,588 for the basic Images 1] system from Computer 
Graphics Lab to almast US$3aa, aaa for a fully canfigured Alias Research system. 
In between are systems such as thase frem Symbalics, Artranics, Nea-Visuals and 
Basch with their well established FGS-4aaa, 


Software is available from several companies, including Wavefront Techncalogies, 
Nea-Visuais, Antics and Computer Graphics Lab, Dut the big news at the shaw was 
the new affering from Abel Image Researcn. Their saftware is canfigured (and 
priced) ta run on a wide range af hardware, from a Single Iris workstation at 
US$8a, Aaa te a Cray at about USS$3dd, aaa. The first license for Abel’s saftware 
package was issued well before the announcement at Siggraph ta Electric Image 
im London, England. They started praducticon this oast May and have already 
curmned aut several jobs and some excellent work. 


Computer manufacturers were cut in force, all displaying software from varicus 
Suppilers parted ta their machines. The wide range of available camputers 
suitable for camputer arimation praducticm and the variety of software 
offerings ta run them, give praspective purchasers ccansiderable choice. 


Included ain the interesting French industry exhibit was the De Grafe computer 
which cames with its awn integral videc graphics software allcawing a wide range 
af applicaticans as supplied. Other computer animaticn systems from France were 
the Xeon Grapn 9 and Teimat's CURT 7. 


With all the systems, software and camouter canfiguraticons raw available, 
getting inte the computer animation business 1S beccaming much easier and less 
expensive every day. Ard this trerid is continuing. It is getting to the point 
where the equipment required ta output and record your images 1s carnsiderably 
more expensive than the equipment needed ta create these images. 


Praducivig camolex images quickly still recuires lats of expensive computer 
power, so the high-erd of the business will remain fairly exclusive. Hut the 
bDraac range af proacuction for video cautput, as indicated by the Trade 
Exhibition and ever some of the oraductions shawn, 1S becoming a very 
competitive market. For ooth equinoment/systems/software suppliers as well as 
orecducers, Naw you well you use your equipment ar market your preaduct, is 
becoming the key ta business success. 
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TONY DE PELTRIE 


Tne nmigniignt of the Siggraph '85 Film & Video Show was tne world premiere of 
tre smart animated film Tony de Feltrie. Entirely created by computer, it not 
only 1s a technical irene rere out umlike many otner deveicomental 
ConGuver animatian prajects, it is each ely eritertaining alsa, 


Pais is full tnree dimensional cnaracter arimation, camplete with changing 
Facial expressions, smooth body mavenernts and even Close-ups af the character's 
nangs anc fingers as he olays his piano. Rurmine for 7 minutes and 5a seconds, 
it iS fore thar a Brief systems/software test. It tells the story af an aging 
plano player in a planc-bar, whe although once the center af popularity nas maw 
beer left behind by times and cnarioing fashions. Surrounded by his memcaries, he 
Qlays his last performance. 


Tony de Feltrie is the creaticn of an indecscendent group maae up of Pierre 
Lachapelle, the originator of the project and its precucer and ca-directar, 
Peiliope Bergeren, co-director and cnaracter ariimator, Hierre Radidaux, ca- 
directer and special effects animator and Daniel Langlais, co-director and art 
director. initiated in July, 1982, finai preducticn required about four man 
years ta complete and cast over US$1.5 million. 





The film was produced using TAGRNG, a powerful 3-D computer grapnics system 
concezved for artists and cGesigners develcoed at the Centre de Caicui ce 
liumiversite ce Mertreal and orcgrammed with DADS software. Filming in 35mm was 
snot at Canada’s National Film Hoard, and financial Ssuopart came fram the 
Miristere des Affairs Culturelies du Guesec and the Canada Council. About 22 
peaple in tetal worked on cr assisted directiy in the project. 


Osviously mot a cast-effective orscuction, Tory G@ Peltrie is one af thease 
landmark efforts that dot the short history of Seam animation. Like Triple 
ift’s Agam the Juggier, and The Adventures of Andre ard wWaliy EK fran 
ciicasfilm's Comouter Division, sust ta name a eoupie, 1t shows what can be dane 
with tne medium ard gives us a target to aim for an cur more mundane everyday 
proqducticne, But trust our inventive French Canacians to make a dramatic and 
entertaining praduction out of it. 


Lachapeile, Bergeron, Rabidaux & Langlais 
L.733 Draseau, Montreal Nerd 6G, Canada Hik 3K7. ($14)737-48a0 or (S14) 225-524 


ELECTRIC IMAGE 


Cur apologies ta Fanl Decherty arc Electric Image in Lornoon. Contrary ta wnat 
We wrote in aur May/85 issue, con are nev a Givision of Yisisons. Although they 
Snare premises, Visions nas me financial or administrative interest wm Electric 
Image, or vice-versa. 


Electric Image is tne first iicernsees of the new Ode} image Researcn software 
anmaunceGc at Sigeraph *85, anc was just getting their system up and running 
wher visited last May. They have been very ousy Since, and we'll have same 
eamdies af their exceiiernt work to show our readers soc, 
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INTERACTIVE MACHINES INC 





One of the less imposing computer manufacturer's booths at the Siggrapn 785 
Trade Forum was that af Interactive Machines Inc. Nea fancy furniture oar. 
eglorful grapnics, just an IMT Sa@ and an IM] 455, both up and running, and 
always several interested people. 


Although their computers may have wicer distrioution in the design, engineering 
and aeronautical industries, it seems tnat just adcut any sericus computer 
animation facility has at least one IMD Bad. Sametimes tnat’s all they have 
although they are generally used in canjyunction with VAXs, Ridges, Pyramids or 
ever. Crays. One reason is that the IML S@@ is designed for a graphics 
applicaticar. 


Anether is that IMI Fresidernt Ker Dozier, with a business background ranging 
from flight simulaters and super computers ta computer graphics preducticn, has 
a goad grasp of what’s happening and where the dSusiness 158 going. When 
questianed, he 1s quite emphatic about it, "In an emerging marketplace the 
ability to inmevate is the ability ta survive. Sea that’s wnat everyone is 
lecking for, the ability tc be able ta inravate and therefcre they will thrive 
and survive in this emerging marketplace. And that’s really very important." 


Having worked with many computers, including the Cray, he believes that 
"flexibility" im equioment choice is equally important, “It's the flexibility 
that you're looking for. I think that anybody that paints themself ainta a 
hardware pasitian right now is really going ta feel it." 


Dozier believes that the IMI S@2@ is right in step with the new industry 
emphasis on Simulation. "“Freduction graphics is hierarcnical by nature. The 
portion that we fill is the interactive portion. We are a hign resolution, real 
time computer simulation device. Ard what we have is an internal graphics 
processor that enables us ta do an extensive amount af mathematics in real 
time," He is alse a big premeter of the grawing use af computers for scene pre- 
production, “We can shaw up ta 1@@ objects moving dynamically in real time, 
which plays right inte the capability of doing scene planning. Entertainment is 
constantly putting their emphasis on pre-production - especially in tne area of 
special effects. They want ta minimize risks. " 

As for IMI’s future sraspects, he believes that the inherent flexindility of tne 
IMI S@@ will cenmtinue ta stimulate their sales. “The community out there 1s 
insatiable, and once it sees it, it tires of it and wants something rew. The 
days «af the old hast computer with the program running on it and the dump 
terminal are dead. Now the local imtelligence design/simulaticon station has to 
have intelligence of its own. Its cwr disks and its oWn operating system. And 
it has tc be very, very, very interactive. And then it will be straoped onto a 
host tc pravide secandary number crunching abilities for it." 


We think Ken Dozieris comments on what’s happening im cur industry are right om 
target, and spotlight some af the industry changes reflected at Siggraph. 


Interactive Machines Inc | 
733 Lakefield Road, Westlake Village CA, USA 91361-2694. (818)707-18828 
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BOSCH FGS-42000 UPGRADES 


With lots cf new venders appearing in the turn-key systems market, Robert Basch 


is working hard ta keep their FGS-48a4 computer graphics system competitive arid 
maintain their market lead. 


Two new scftware packages affered are their Light Source Editar, which carn 


—_— — —— ee ee ore ee oe —_— oe ee et ee 


contral and coanfigure up ta 16 independent light sources, and their Terrain 


Medeler for generating random land sources with fractal-like surfaces. They 


have also added twa major enhancements ta their existing saftware, a ‘qgouraud’ 
shading model anc a ‘pharg’ shading model, and are cantinuing develcpment af a 


—a mite = — ee ee me oot oo oe — > ate ate mie 


To increase sales intca industrial markets and other areas tnat have neat 
traditionally used their videc/broadcast products but are potentials for the 
FGS-4002, ard are alsa being pursued by their coampetitars, Bosch has 
repositioned its national sales force into twa separate divisions, with cane 
dedicated ta these new market areas. 


The Bosch dema tape shown at Siggraph °85 is an eight-minute compilation af 
work produced on the FGS-4@8@ by users arcund the world. 


Rabert Rosch Corporation - Video Equipment Divisicn 
FQ Bax 31816, Salt Lake City UT, USA 84131. (88@1)972-e8aN0 


UPCOMING EVENTS 


Octaber 4-95: 

F.I.D.AJA.M. - First International Festival af Cartoon Animation and Puppetry, 
Juan-Les-Fins, France. Computer animation included. 

e838 Rue Gioffredoa, MEA Nice, France. (93)62 Oe Fa 


Octceber 16-18: 

Computer Graphics ' 85, 

Wenbly Conference Ceriter, London, England. 

Crline Conferences, FPirner Green House, Ash Hill Drive, Finner, Middlesex, 
HAS @AE, Enoland. (@1)868-4466 


Neovenber 3-1: ‘ 
Cambridge Animation Festival '85, 
PQ Box 33e, Loandan, England N6 SYH. (@1)341-S5a15 


Navember 8: 

One Day Seminar in Computer Aided Art & Design (CAAD) 

Faculty af Art & Design, 

Middlesex Polytechnic, Cat Hill, Barnet, Herts EN4 BHT, Engiand (@1)44@-5181 


There are many events of potential interest ta cur readers accuring regularly 
all over the world. Keeping track of them is almast a full time job and to list 
them within these pages 1s impossible. Insteaq we are compiling an on-going 
list that we will regularly update and sublisn periedically fer our readers. 
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ROBERT ABEL & ASSOCIATES. 


"Erilliance", Sexy Robat 32 second soot! 


for Carnned Foad Information Couneil 1s 
Abel shtwcase for new satftware 
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>. TONY DE PELTRIE. 
| 


OProduced by Fierre iLachapelle, Prhilaipnpel 


| Bergeron, Fierre Robidoux and Daniel ' 
j Langlois at Universite de Montreal. | 





4. MAGI SYNTHAVISION. 1 
| Experimental work shawing transparency 
| from their new snawreel. i 





5, CHRIS WEDGE. 
/ “Tuper's Twa Steno", 3-D non geometric / 
| eharacters in well cnoresgrapned set. t 
| teaduced at Ghic State University. ! 
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1, PACIFIC SATA IMAGES, 
"Naocy Orinking sires", develoomenta: 





| animation sy Richard Comen, who trainesi 


| at Sherigan Coliece in Toronto. 
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3. PACIFIC DATA IMAGES. 
| "Toast on a Lounge Chair’, | 
; Gevelopmental animation by Roger Gouic. 3 
| work 1S great when you are having fun. : 











Se OMNIELS COMPUTER GAAPHICS, 
| beS's Tali 'a5 an-ay- apart ie Desian 


i pv Jonn retravest. fort Fallack W for ; 
- C23 Cntertairment Division. 
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ce PRCT IC ~ATA IMAGES. 
oe ce’, cevelopmental animation 

¥ chaddeus Berer. Part of FDI's after-1 
Sire sersonal work program. 
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4, P&CIFIC DATA IMAGES. 7 

i NBC's fall '85 on-air promos continue 
| tneir “Be There” camoaign. Desian by 
"Harry Marks. Jonn Miller for NBC. 
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6, OMNIBUS COMPUTER GRAPHICS, 
' Soec:al promo oy Shell Brazil for majori 
| music festival, produced for MoviArt, | 
i 2o-ord Sy Ziliion, design py O&M Publ. | 
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Papers from Siggraph 76: The Third Annual 
Conference on Computer Graphics, Interactive 
Techniques, and Image Processing 


Sponsored by ACM Special Interest Group on Computer Graphics 


July 14-16, 1976 


The organizers of the first annual conference on 
computer graphics and interactive techniques believed 
that a joint SIGGRAPH-SIGPLAN effort to identify, con- 
solidate, and study computer graphics principles 
would lead to increased, productive understanding of 
many topics. The success of the first annual conference, 
held in Boulder in July 1974, followed by an equally 
successful second annual conference, held in Bowling 
Green in June 1975, gave rise to plans for the third 
annual conference held in Philadelphia. The topics 
studied in the papers for the third annual conference 
covered a rich spectrum of current research interest, 
including: programming languages and data structures; 
hardware; device independent graphics, animation; 
satellite graphics; cartography; graphics input/output 
devices; system software; application areas such as 
medical/biomedical, civil engineering and architecture; 
Surface representations, CAD: large systems and 
social graphics. 

The program committee for the third annual con- 
ference received 143 abstracts, 118 papers, representing 
almost all areas of current research, and selected 58 
papers for presentation. From this program of papers, 
four have been chosen for publication in this issue.* 
All papers were reviewed by referees in the usual 
fashion and the four appearing here are each revised 
or extended revisions of the original papers, with only 
an abstract of these four papers appearing in the 
Conference Proceedings. 

The paper by Blinn and Newell, an excellent ex- 
tension to the Catmull algorithm, presents develop- 
mefits in the areas of texture simulation and lighting 
models, with some beautiful generated images resulting 
from the reported work. In addition, the authors present 
Simulation of mirror reflections on curved surfaces, 
incorporating reflected light intensity as well as the 
intensity obtained from the texture mapping. 

Two of the papers study algorithms for displaying 
surfaces or advanced hidden surface techniques. The 


— 





* The remaining papers appear in Computer Graphics (ACM 
SIGGRAPH newsletter), Vol. 10, No. 2 (Summer 1976). This 
special issue is available prepaid from: ACM, P.O. Box 12105, 
_ Church Street, New York, NY 10249; ACM and/or SIGGRAPH 
members $10.00, nonmembers $20.00. 
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paper by Clark deals with the design of systems for 
efficiently producing computer generated pictures and 
picture sequences of very complex three-dimensional 
environments. An important aspect of the proposed 
hierarchical model is that it provides a way to vary de- 
tail, yielding an incremental improvement to existing 
Systems for producing computer pictures without 
modifying their visible surface algorithms. An addi- 
tional improvement is suggested by a totally new re- 
cursive descent visible surface algorithm in which the 
computation time potentially grows linearly with the 
visible complexity of a scene rather than as a worse 
than linear function of the object-space complexity. 

The paper by Levin considers an algorithm for 
drawing pictures of three-dimensional objects, with 
surface made up of patches of quadric surfaces. Meth- 
ods of parameterization for each type of quadric sur- 
face intersection curve are discussed, as well as surface 
bounding and hidden surface removal. The principal 
contributions of this algorithm are that it uses all real 
quadric surfaces, allowing smooth intersections of ap- 
proximate higher-order surfaces, and that it may be 
fast enough to be used in conjunction with a shading 
algorithm. 

The problem of motion dynamics in animation is 
made challenging by the fact that the dynamics de- 
scription must not only describe the behavior of the 
object but also all motion subcomponents relative to 
each other. The paper by Burtnyk and Wein develops a 
Significant increase in the capability for controlling 
motion dynamics in key frame animation through 
Skeleton control. This technique allows an animator to 
develop a complex motion sequence by animating a 
stick figure representation of an image, and then driv- 
ing this image sequence through the same movement. 

The members of the executive committee for the 
third annual conference were T. Johnson (General 
Chairman), U.W. Pooch (Technical Program Chairman 
and Proceedings Editor), Norman Sondak (Publicity / 
Publications Chairman), and Katherine Moses (Local 
Arrangements Chairman). 


U.W. PoocH 
Special Issue Editor 


Communications October 1976 
of Volume 19 
the ACM Number 10 





Graphics and 
Image Processing 


Texture and Reflection 
in Computer 
Generated Images 


James F. Blinn and Martin E. Newell 
University of Utah 


In 1974 Catmull developed a new algorithm for 
rendering images of bivariate surface patches. This 
paper describes extensions of this algorithm in the 
areas of texture simulation and lighting models. The 
parametrization of a patch defines a coordinate system 
which is used as a key for mapping patterns onto the 
surface. The intensity of the pattern at each picture 
element is computed as a wei ghted average of regions 
of the pattern definition function. The shape and size 
of this weighting function are chosen using digital 
signal processing theory. The patch rendering algorithm 
allows accurate computation of the surface normal to 
the patch at each picture element, permitting the simula- 
tion of mirror reflections. The amount of light coming 
from a given direction is modeled in a similar manner 
to the texture mapping and then added to the intensity 
obtained from the texture mapping. Several examples 
of images synthesized using these new techniques 
are included. 

Key Words and Phrases: computer graphics, graphic 
display, shading, hidden surface removal 

CR Categories: 3.41, 5.12, 5.15, 8.2 


Copyright © 1976, Association for Computing Machinery, Inc. 
General permission to republish, but not for profit, all or part 
of this material is granted provided that ACM’s copyright notice 
is given and that reference is made to the publication, to its date 
of issue, and to the fact that reprinting privileges were granted 
by permission of the Association for Computing Machinery. 

A version of this paper was presented at SIGGRAPH ’76: 
The Third Annual Conference on Computer Graphics, Interactive 
Techniques, and Image Processing, The Wharton School, Uni- 
versity of Pennsylvania, July 14-16, 1976. 

This work was supported in part by ARPA under Contract 
DAH15-73-C-0363. Author's address: Computer Science Depart- 
ment, University of Utah, Salt Lake City, UT 84112. 
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Introduction 


In 1974 Edwin Catmull [2] developed an algorithm 
for rendering continuous tone images of objects modeled 
with bivariate parametric surface patches. Unlike most 
earlier algorithms [6, 8, 9, 10], which require that ob- 
jects be approximated by collections of planar poly- 
gons, Catmull’s algorithm works directly from the 
mathematical definition of the surface patches. The 
algorithm functions by recursively subdividing each 
patch into smaller patches until the image of each 
fragment covers only one picture element. At this 
Stage, visibility and intensity calculations are performed 
for that picture element. Since the subdivision process 
will generate picture elements in a somewhat scattered 
fashion, the image must be built in a memory called a 
depth buffer or Z-buffer. This is a large, random access 
memory which, for each picture element, stores the in- 
tensity of the image and the depth of the surface visible 
at that element. As each patch fragment is generated, 
its depth is compared with that of the fragment cur- 
rently occupying the relevant picture element. If greater, 
the new fragment is ignored, otherwise the picture 
element is updated. 

This paper describes extensions of Catmull’s al- 
gorithm in the areas of texture and reflection. The 
developments make use of digital signal processing 
theory and curved surface mathematics to improve 
image quality. 


Texture Mapping 


Catmull recognized the capability of his algorithm 
for simulating variously textured surfaces. Since the 
bivariate patch used is a mapping of the unit square in 
the parameter space, the coordinates of the Square 
can be used as a curvilinear coordinate system for the 
patch. It is a simple matter for the subdivision process 
to keep track of the parameter limits of each patch 
fragment, thereby yielding the parameter values at 
each picture element. These parameter values may then 
be used as a key for mapping patterns onto the sur- 
face. As each picture element is generated, the para- 
metric values of the patch within that picture element 
are used as input to a pattern definition function. The 
value of this function then scales the intensity of that 
picture element. By suitably defining the pattern func- 
tion, various surface textures can be simulated. 

As Catmull pointed out, simply sampling the tex- 
ture pattern at the center of each picture element is 
not sufficient to generate the desired picture, since two 
adjacent picture elements in the image can correspond 
to two widely separated points in the patch parameter 
space, and hence to widely separated locations in the. 
texture pattern. Intermediate regions, which should 
somehow influence the intensity pattern, would be 
skipped over entirely. This is a special case of a phe- 
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nomenon known as “aliasing” in the theory of digital 
signal processing. This theory [7] treats the image as a 
continuous signal which is sampled at intervals cor- 
responding to the distance between picture elements. 
The well-known ‘‘sampling theorem”’ states that the 
sampled picture cannot represent spatial frequencies 
greater than | cycle/2 picture elements. ‘Aliasing” 
refers to the result of sampling 4 signal containing fre- 
quencies higher than this limit. The high spatial fre- 
quencies (as occur in fine detail or sharp edges) re- 
appear under the alias of low spatial frequencies. This 
problem is most familiar as staircase edges or ‘‘jag- 
gies.”’ In the process of texture mapping, the aliasing 
can be extreme, owing to the potentially low sampling 
rate across the texture pattern. 

To alleviate this problem we must filter out the high 

Spatial frequency components of the image (in this 
case the texture pattern) before sampling. This filter- 
ing has the effect of applying a controlled blur to the 
pattern. This can be implemented by taking a weighted 
average of values in the pattern immediately surround-. 
ing the sampled point. Digital image processing theory 
provides a quantitative measure of the effectiveness of 
such weighting functions in terms of how well they 
attenuate high frequencies and leave low frequencies 
intact. . 
_ Catmull achieved the effect of filtering by main- 
taining an additional floating-point word for each 
picture element. This word contained the fraction of 
the picture element covered by patch fragments. For 
each new fragment added to the picture element, the 
texture pattern was sampled and the intensity was 
averaged proportionally to the amount of the picture 
element covered by the patch fragment. Examination 
of the spatial frequency filter effectively implemented 
by this technique shows that it is much better than 
point sampling but is not optimal. 

The method discussed here does not require the 
extra storage and uses a better anti-aliasing filter. 
This filter is implemented by a weighting function origi- 
nally used by Crow [3] to minimize aliasing at polygon 
edges (‘‘jaggies’’). It takes the form of a square pyra- 
mid with a base width of 2X2 picture elements. In 


Fig. 1. Region of texture pattern corresponding to picture element: 
left-hand side shows texture; right-hand side shows image. 
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the texture mapping case, the 22 region surrounding 
the given picture element is inverse mapped to the cor- 
responding quadrilateral in the u,v parameter space 
(which is the same as the texture pattern space), see 
Figure 1. The values in the texture pattern within the 
quadrilateral are weighted by a pyramid distorted to 
fit the quadrilateral and summed. 

The derivation of the quadrilateral on the texture 
pattern makes use of an approximation that the para- 
metric lines within one picture element are linear and 
equally spaced. The X,Y position within a picture 
element can then be related to the u,v parameters on 
the patch by a simple affine transformation. This trans- 
formation is constructed from the u,v and X,Y vatues 
which are known exactly at the four corners of the 
patch fragment. 

Given this algorithm, we now investigate the effects 
of various texture definition methods. We will use, aS 
our sample object, a plain teapot constructed of 26 
bicubic patches. First, the pattern may be some simple 
function of the u,v parametric values. A useful ex- 
ample of this is a simple gridwork of lines. The result is 
as though parametric lines of the component patches 
are painted on the surface, Figure 2. Note that the 
edges of the pattern lines show very little evidence of 
aliasing in the form of staircases. Second, the pattern 
may come from a digitized hand drawn picture, Figure 
3. Third, the pattern may come from a scanned-in 
photograph of a real scene, as in Figure 4. Incidentally, 
this picture makes the individual patches very clear. 
This type of pattern definition enables the computer 
production of ‘anamorphic” pictures. These are pic- 
tures which are distorted in such a way that when viewed 
in a curved mirror the original picture is regenerated, 
Figure 5. The patch itself is defined so that the para- 
metric lines are stretched in approximately the correct 
fashion and a real photograph is mapped onto the 
patch. Figure 5 should be viewed in a cylindrical mirror 
(e.g. a metal pen cap) with the axis perpendicular to 
the page. The fourth source of texture patterns shown 
here is Fourier synthesis. A two-dimensional frequency 
Spectrum is specified and the inverse Fourier transform 
generates the texture pattern. This is a simple way of | 
generating wavy or bumpy patterns. Certain restrictions 
on the form of the input spectrum must be followed to 
ensure that the pattern has an even distribution of 
intensities and is continuous across the boundaries. 
An example of this type of texture is shown in Figure 
6. The texture patterns used here were generated before 
picture synthesis began and stored as 256256 
element pictures ‘in an array in random access 
memory. 


Reflection in Curved Surfaces 


The second topic discussed in this paper concerns 
lighting models. Typically, visible surface algorithms 
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idwork texture pattern: left-hand side shows texture pattern; right-hand side shows textured object. 


Fig. 2. Simple gr 





Fig. 3. Hand sketched texture pattern: left-hand side shows texture pattern; right-hand side shows textured object. 





have been used to give the impression of shiny surfaces, 
but there is little physical justification for such func- 


determine intensities within an image by using Lam- 
bert’s (cosine) law: i = s(L-N), where i = intensity, 


S = surface shade, L = light direction vector, N = tions. and the range of effects is limited. The modeling 
Surface normal vector, and ‘-’”’ denotes vector inner of more realistic lighting was first investigated by Bui- 
product. Tuong Phong [1]. His model of reflection incorporated. 

Variants on this function, such as a term which produced a highlight over portions of the 


i= S(L-N)een, tor n> | Surface where the normal falls midway between the light 
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Source direction and the viewing direction. This is moti- 
vated by the fact that real surfaces tend to reflect more 
light in a direction which forms equal angles of inci- 
dence and reflectance with the surface normal. This can 
be easily implemented by simulating a virtual light 
source in a direction halfway between the light source 
and viewing directions, and raising the result to some 
high power to make the highlights more distinct: 


i= s(L-N) + g(L’-N)««n 


where L’ = virtual light source direction, g = glossiness 
of surface (0 to 1). Figure 7 shows an image generated 
using the above function with n = 60. These techniques 
work well for satin type surfaces but images of highly 
polished surfaces still lack realism. This is largely due 
to the absence of true reflections of surrounding objects 
and distributed light sources. 

The simulation of reflections in curved surfaces 
requires an accurate model of the properties of the 
surface and access to accurate normal vectors at all 
points on the surface. The approximation of curved 
surfaces by collections of planar polygons is inade- 
quate for this purpose, so extensions of the techniques 
of Gouraud [5] and Bui-Tuong Phong [1] hold little 
promise. 

The subdivision algorithm, however, provides ac- 
curate information about surface position and can 
be made to give accurate surface normals at every 
picture element. This is the first algorithm that provides 
the appropriate information for the simulation of mirror 
reflections from curved surfaces. For each picture ele- 
ment, the vector from the object to the observer and 
the normal vector to the surface are combined to de- 
termine what part of the environment is reflected in 
that surface neighborhood. It can be shown that, for 
surface normal vector (Xn, Yn, Zn) and viewing posi- 
tion (1, 0, 0), the direction reflected, (Xr, Yr, Zr), is 


Xr = 2#XneZn, Yr = 2*Yn*Zn, Zr = 2*Zn«Zn — I, 


Having established the direction of the ray which is 
reflected to the eye, it remains to find what part of the 
environment generated that ray. For this, a model of 
the environment is needed which represents surround- 
ing objects and light sources. Clearly, the view of the 
environment as seen from different points on the sur- 
face will vary. However, if it is assumed that the en- 
vironment is composed of objects and light sources 
which are greatly distant from the object being drawn, 
and that occlusions of the environment by parts of the 
object itself are ignored, then the environment can be 
modeled as a two-dimensional projection surrounding 
the drawn object. Stated another way, the object is 
positioned at the center of a large sphere on the inside 
of which a picture of the environment has been painted. 
These simplifications allow the environment to be 
modeled as a two-dimensional intensity map indexed 
by the polar coordinate angles of the ray reflected 
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Fig. 5. Anamorphic image. 





Fig. 6. Fourier synthesis of texture: top shows texture pattern; 
bottom, texture object. 
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Fig. 8. Computer generated reflections: left-hand. side shows environment map; right-hand side shows reflection in teapot. 





(Ar, Yr, Zr). (Such maps will be shown with azimuthal 
angle plotted as abscissa and polar angle plotted as 
ordinate.) 

When a reflection direction is computed, it is con- 
verted to polar coordinates and the reflected light in- 
tensity for that direction is read from the map. This is 
similar to the technique used to map texture onto a 
surface, except that the reflection direction, instead of 
Parametric surface position, determines the coordi- 
nates in the map. Figure 8 shows an image generated 
using these techniques. 

Use of the surface normal alone is tantamount to 
modeling the environment on an infinitely large sphere. 
This has the undesirable effect that the reflected in- 
tensity at all silhouette points on the object is the same, 
and corresponds to the intensity of the environment 
model diametrically opposite the eye. This deficiency 
can be corrected by using both the surface normal 
and surface position to determine what part of the 
environment map is reflected in a given surface 
fragment. 


Combinations of Techniques 


The techniques described above for simulating tex- 
ture and reflection can be combined to produce images 
of objects having patterned shiny surfaces. When high- 
lighting is combined with texture mapping, only the 
component from the real light source should be scaled. 
This models the highlight as being specularly reflected 
at the surface and not being affected by the pigment 
within it. In Figure 9 note how the highlights wash 
Out the texture pattern underneath them. The technique 
for texture mapping actually keys the texture to the 
Surface so that it moves with the object. Some other 
techniques, which essentially apply texture to the 2-D 
image, do not have this property. Note, in Figure 9, 
how the highlights hardly move with the teapot, whereas 
the texture does. 

Given the texture mapping technique and the en- 
vironment reflecting technique, we can combine them 
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to produce an image of a highly glazed patterned teapot, 
as in Figure 10. 


Resource Requirements 


The images shown in this paper were all generated 
on 4 PDP-11,/45 computer having a 256K-byte random 
access frame buffer which was used as the depth buffer. 
The main routines were written in Fortran and the 
critical parts were written in assembly language. The 
computation time of the extended subdivision algo- 
rithm is roughly proportional to the area covered by 
visible objects. Images of nontextured objects of 
the type used in this paper take about 25 minutes. The 
addition of texture or reflection increases this time by 
about 10 percent. All images have a resolution of 
512512 picture elements. 


Conclusions 


By refining and extending Catmull’s subdivision 
algorithm, images can be generated having a far higher 
degree of naturalness than was previously possible. 
These generalizations result in improved techniques 
for generating patterns and texture, and in the new 
capability for simulating reflections. 
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1. Introduction 


1.1 Background 

Early research in computer graphics was concerned 
with the organization and presentation of graphical 
information in the form of real-time line drawings on a 
CRT. Many of the concepts of structuring graphical 
information were developed by Sutherland in Sketchpad 
[19], and the line-drawing graphical displays that re- 
sulted from his early research remain the most widely 
used today. With the development of integrated circuit 
technology, research interests shifted to producing very 
realistic, shaded, color pictures of the visible parts of 
complex three-dimensional objects. Because of the 
desire to utilize television technology, the algorithms 
for producing these pictures generated output for a 
raster CRT. The pioneering works in this area were by 
Schumacker et al. [18] and Wylie et al. [23]. 

Computer produced pictures now provide one of 
the most direct and useful ways of communicating 
with the computer. The ability to produce shaded 
pictures that illustrate mathematical functions and 
physical properties of mathematical models is of in- 
contestable value in both research and education. 
With the development of computer controlled simu- 
lators, a real-time computer displayed environment is 
now used to train pilots of aircraft [11, 16], spacecraft 
[9] and ocean vessels [2]. Other significant uses of com- 
puter pictures include computer aided design [4], 
modeling of chemical structures (22], and computer 
animation {7, 12]. With this increased value of com- 
puter generated pictures, comes an increasing need to 
devise efficient algorithms that improve the realism and 
enhance the descriptive power of these pictures. 


1.2 Motivation for New Research 

The underlying motivation for new research on 
computer produced pictures is to either enhance the 
realism of the pictures or improve the performance of 
the algorithms that generate them. Most recent re- 
search has addressed a combination of these issues. 

There are three basic approaches to improving 
picture quality. The first is to devise clever ways to 
add information value to a scene without significantly 
increasing the total amount of information in the data- 
base for the scene, for example, without increasing the 
number of polygons used in representing the objects. 
Approaches of this type usually make subtle changes 
to the visible surface and shading algorithms that result 
in greatly improved pictures. Examples are the im- 
provements to shading algorithms devised by H. 
Gouraud [10] and Bui-Tuong Phong [15]. 

The second approach is to employ more refined 
mathematical models for the objects being rendered 
and to devise algorithms that can find the visible surfaces 
using these models. The goal of these methods is to 
model smooth surfaces with surface patches, such as 
Coons patches [5] or B-splines [4, 17], rather than with 
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clusters of polygons, and still not increase the size of — 
the database. Catmull’s [3] ingenious algorithm is an 
example of this approach. The benefit of these methods 
is that an arbitrarily refined description of the model is 
present, thus allowing much better renditions of con- 
tours and shading. The disadvantage is that because of 
nonlinear mathematics, the algorithms are less effi- 
cient than polygon-based algorithms. 

The third approach is to increase the information in 
the database and employ more structured methods for 
handling the increased information. The motivation 
for this approach is that the information value of a 
scene grows in proportion to the amount of informa- 
tion in the database for the scene. Newell’s [13] al- 
gorithm is an example of this approach. 

The structured approach appears to be the most 
promising of these approaches since it potentially im- 


_ proves both picture quality and algorithm performance. 


However, there are several problems associated with 
this approach. First, increased complexity of a scene, 
or increased information in the database, has less 
value as the resolution limits of the display are ap- 
proached. It makes no sense to use 500 polygons in - 


describing an object if it covers only 20 raster units of 


the display. How do we select only that portion of the 
data base that has meaning in the context of the resolu- 
tion of the viewing device? Second, how do we ac- 
commodate the increased storage requirements of this 
additional information? We might, for example, wish 
to model a human body to the extent that a closeup 
view of the eye shows the patterns of the iris, yet such 
a fine description of the entire body will indeed require 
large amounts of store. Third, how much information 
must be presented to convey the information content 
of the scene? In other words, we would like to present 


_ the minimal information needed to convey the meaning 


of what is being viewed. For example, when we view 
the human body mentioned above from a very large 
distance, we might need to present only ‘‘specks’”’ for 
the eyes, or perhaps just a ‘‘block”’ for the head, totally 
eliminating the eyes from consideration. The amount 
of information ‘‘needed” can be the subject of psycho- 
logical debate, but it is clear that even coarse decisions 
will yield more manageable scenes than attempting to 
use all of the available information. 

These issues have not previously been addressed in 
a unified way. The research described here represents 
an attempt to solve these and related problems. 


2. Summary of Existing Algorithms 


Visible surface algorithms may be categorized ac- 
cording to whether they employ polygons, parametric 
surface patches, or procedures as the underlying method. 
of modeling the surfaces they render. The most thor- 
oughly studied types of algorithms use polygons. How- 
ever, because of the shortcomings of representing 
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smooth surfaces with faceted clusters of polygons, 
some research interest has recently been devoted to 
parametric surface algorithms, which allow higher 
degrees of continuity than just positional continuity. 
The algorithms for these different modeling methods 
will be discussed separately. 


2.1 Polygon-Based Algorithms 

A highly informative survey of existing polygon- 
based visible surface algorithms has been written by 
Sutherland et al. [20]. As they point out, a convenient 
way to classify these algorithms is according to the 
order in which they sort the image space polygons that 
are potentially visible in a scene. The basic difference 
between the major algorithms is in whether they sort 
in depth (from the viewpoint) before the vertical- 
horizontal sort, or vice versa. 


Depth-first sort. The most significant algorithms to 
use this sorting order are due to Schumacker et al. [18] 
and Newell et al. [14]. Schumacker utilizes this order 
along with a polygon clustering concept to achieve a 
coherence from one frame to the next, while Newell 
utilizes it to render translucent images. By first com- 
puting a priority ordering of polygons according to 
their image space distance from the screen, they are 
able to establish which polygon segments on a given 
scan line have visibility priority. 

Newell uses this information to write those segments 
with a lesser priority into a scan-line buffer before 
writing in those with a greater priority. Thus greater 
priority segments which are from translucent polygons 
only modify the intensity values in the buffer rather 
than completely overwriting them. While there is 
clearly a considerable overhead in writing into the 
buffer segments that might eventually be obscured, 
some beautiful pictures have resulted from this work. 

Schumacker’s goal is to produce real-time picture 
Sequences. Rather than writing the polygon segment 
information for a scan-line into a buffer according to 
Its priority, a set of priority-ordered hardware registers 
are simultaneously loaded with the priority-ordered 
segment information. Then as tiie scan line is displayed, 
the register information is counted down and a com- 
binational-logic network selects the appropriate highest 
priority register according to its lateral displacement 
on the screen. This approach requires a separate set of 
registers for each polygon segment that intersects the 
scan line. Nonetheless, it represents the first real-time 
solution to the visible surface problem [9]. 

There are two very significant features to Schu- 
macker’s work. First, he makes use of a priori knowl- 
edge of the database to compute fixed priorities for 
clusters of polygons. If the polygons in a group of 
polygons are not subject to changes in relative place- 
ment, they form a cluster and may be assigned fixed 
priorities which work no matter from where the cluster 
is viewed. Thus part of the priority ordering is fixed 
with the environment and need not be recomputed 


549 


each frame. Second, he shows that if the environment 
Is restricted so that the clusters are linearly separable, 
an intercluster priority can be established that does 
not change unless the viewpoint crosses one of the 
separating planes; hence, the priority ordering remains 
fixed from one frame to the next unless one of the 
planes is crossed. 

This work by Schumacker and coworkers represents 
the only visible surface algorithm to make use of both 
Structured information (clustering) and frame to frame 
coherence (relatively constant intercluster priority). 
These very important concepts will be discussed in 
more detail later. 


Depth-last sort. The algorithms that use this sort- 
ing order have been devised by Watkins [21], Bou- 
knight [1], and Wylie et al. [23]. They are referred to as 
scan-line algorithms and differ only in their use of 
various image-space coherence properties. All three 
first perform a vertical bucket (radix) sort of polygon 
edges according to their uppermost vertices. Then for. 
each scan line, the various polygon segments on that 
scan line are sorted according to their horizontal dis- 
placements. The depth sort is deferred until last under 
the assumption that the initial two sorts will decrease 
the number of depth comparisons needed to determine 
final visibility. 

Of the three approaches, Watkins’ is the most 
economical because of its uses of scan-line coherence 
and a logarithmic depth search. The assumption of 
scan-line coherence is that in going from one scan line 
to the next, the number of changed polygon segments is 
small; hence the horizontal sort may be optimized to 
take advantage of this. Watkins’ is the only other 
algorithm besides Schumacker’s that has been imple- 
mented in hardware. 


2.2 Parametric Surface Algorithms 

Modeling smooth surfaces with collections of 
polygons leads to problems both in shading the sur- 
face and in rendering the contour edges. While there 
have been a number of very clever improvements to 
the quality of such pictures without significantly in- 
creasing the amount of information used, notably 
those of Gouraud {10}, Phong [15], and Crow [6], the 
most direct approach is to employ a more refined model, | 
such as parametric surface patches. Such patches can 
be used to define the surface using no more, and usually 
even less, information than is required with polygons. 
Yet they can join together with tangent or even higher 
continuity, thus eliminating the above problems. The 
difficulty with this method is that the mathematics is 
no longer linear; to explicitly solve for such things as 
the curve of intersection of two bi-cubic patches or of a 
patch and a clipping plane are very difficult problems. 

Catmull [3} solves such problems, but -not ex- 
plicitly. Rather, he does so by employing the discrete 
character of the image space, a recursive algorithm, 
and what he calls a Z-buffer. For each patch in the 
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environment his algorithm asks: does the patch extend 
over more than a single raster unit? If the answer 1S 
yes, the patch is subdivided (by a very fast algorithm 
for bi-cubic patches) into four patches and the same 
question is recursively asked of these patches. When the 
answer finally is no, an attempt is made to write the 
intensity and depth coordinates for the resulting 
“patch” into a buffer for the raster unit, or pixel, in 
question. The attempt fails if the pixel buffer already 
has in it a depth coordinate nearer to the observer 
(with some minor modifications to allow for translucent 
patches). 

A very significant feature of Catmull’s algorithm is 
that, despite the more complex mathematics, it will 
actually work faster than polygon-based algorithms 
if the object being rendered occupies a very small area 
of the screen. Because of the recursive Structure of the 
algorithm, it will ‘structure’ the surface no more 
finely than the resolution of the display dictates, 
Whereas current polygon-based algorithms keep the 
same structural description, i.e. the same number of 
polygons, no matter how much of the screen area is 
occupied. This notion of Structuring will be extended 
to include polygon-based algorithms in the next section. 


2.3 Procedurally Modeled Objects 

Newell [13] has recently employed procedural model- 
ing to solve the visible surface problem for complex 
scenes. According to this approach, objects are modeled 
using procedures which “know how” to render them. 
selves in terms of their own primitives, which might 
include activations of other object procedures; such 
knowledge includes rendering only their visible parts. 
This is a very general way to represent objects. 

Although the underlying philosophy of this approach 
is very general, in the actual implementation Newell 
user polygons as the basic primitives for the objects. The 
object procedures ure activated according to a priority 
ordering so that more distant objects are activated 
first. Each procedure renders the object it represents by 
activating the Watkins process, the results of which 
are written into a frame buffer. The net result js there- 
fore a “hybrid” Watkins/Newell priority algorithm. 

The significant point about this algorithm is not 
the procedural modeling but that it represents another 
example of structuring to simplify the total sorting 
problem, namely that the geometric primitives of one 
object need be compared with those of another only 
when the objects overlap. 


3. Hierarchical Approach 


It was indicated in the previous section that, aside 
from uses of image-space coherence to reduce the 
amount of sorting required, the most fruitful gains in 
visible surface algorithm research have resulted from 
Structuring the environments being rendered. However, 
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the structures employed take a diverse variety of forms, 
from Catmull’s implicit Structuring of surface patches 
to Newell’s procedural objects. What is needed is a 
single, unified, structural approach that embodies all 
of the ideas from these algorithms. Before presenting 
one such approach it is instructive to consider two 
ways in which structure has been utilized to prepare 
objects for visible surface processing. 


3.1 Existing Uses of Structure 


Defining relative placement. The benefits of a posi- 
tion or motion structure have been realized for some 
time. Sutherland used such concepts in two dimensions 
in Sketchpad, and a number of graphics hardware 
companies incorporate transformation hardware in 
their display devices to accommodate Structural de- 
scriptions. Most of the visible surface algorithms pre- 
sented used a position or motion structure to describe 
positions and orientations of objects relative to each 
other. However, all but the few mentioned in Section 
2 disregard the structure at the visible surface algorithm 
level. That is, all polygons of the objects are trans- 
formed into a common screen coordinate system in 
which the visible surface algorithm works. 

An example of such a structure is shown in Figure 
1. Each node in the hierarchy represents a set of geo- 
metric primitives (e.g. polygons) defining the node and 
the arc leading to the node represents a transformation 
defining the orientation and placement of the node 
relative to its ‘‘parent.” Because each node has its 
own unique transformation defining it, it may represent 
one of many “instances” of the same primitive de- 
scription, or data set. This is a very convenient and 
general way to define and place objects. 


Decreasing clipping time. When simulating a camera 
in a computer-generated environment, some parts of the 
environment must be “clipped” to the field of view of 
the simulated camera. This can be done either by trans- 
forming all of the geometric primitives of each object. 
into the camera, or Screen, coordinate system and clip- 
ping each of them separately or by first clipping some 
bounding volume of the object to see if it intersects the 
boundaries of the field of view. If it does not, then the 
parts of the object lie either totally within or totally 
outside of the field of view and thus need not be sepa- 
rately clipped. This utilization of the above mentioned 
position hierarchy is implicitly assumed, although this 
author does not know if the authors of the various 
algorithms actually made such use of it. 


3.2 New Uses of Structure 


Varying environment detail. By choosing to represent 
an object with a certain amount of detail, one fixes the 
minimum distance from which the object may be dis- - 
played with a realistic rendering. For example, a 
dodecahedron looks like a sphere from a sufficiently 
large distance and thus can be used to model it so long 
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Fig. 1. The traditional motion structure used to position objects 
relative to the “world” and subobjects relative to objects. Each 
arc in the graph represents a transformation. 
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as it is viewed from that or a greater distance. However, 
if it must ever be viewed more closely, it will look like a 
dodecahedron. One solution to this is simply to defirfe 
it with the most detail that will ever be necessary. 
However, then it might have far more detail than is 
needed to represent it at large distances, and in a com- 


plex environment with many such objects, there would | 


be too many polygons (or other geometric primitives) 
for the visible surface algorithms to efficiently handle. 

As mentioned in Section 2, the solution to this 
problem has been to define objects relatively coarsely 
and employ clever algorithms that smooth appropriate 
contours or improve shading to make the object look 
more realistic at close observation. The difficulty with 


these approaches is that at best the range of viewing — 


depth is only slightly improved, and the problem of too 
much detail at large distances usually remains. AI- 
though these approaches have yielded results of un- 
questionable value, it seems evident that multiple levels 
of description must be used to adequately represent 
complex environments.! 

How does one represent these multiple levels of 
description? A solution is to define “objects” in a 
hierarchy like that of Figure 2. The entire environment 
is itself an “object”? and is represented as a rooted tree. 
(“‘Object” is a generic term for the things represented 
by nodes of the tree. This generic term will be used 
for the remainder of this paper.) There are two types of 
arcs in the tree, those that represent transformations 
as before and those that represent pointers to more 
detailed structure (the identity transformation). Each 
nonterminal node represents a ‘‘sufficient”’ description 
of the “object” if it covers no more than some small 


‘Actually, Evans and Sutherland made use of a three-level 
description of the New York skyline in its Maritime simulation, 
but in an ad hoc way [2]. 
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area of the display; the arcs leading from the node 
point to more detailed ‘‘objects” which collectively 
define a more detailed version of the original object if 
its description is insufficient because it covers a larger 
area of the screen. The terminal nodes of the tree repre- 
sent either polygons or surface patches (or other primi- 
tives) according to whether they are primitive elements 
of a faceted or a smooth object. 

As an example of such a description, consider a _ 
model of the human body. When viewed at a very large 
distance, for example when the body covers only 3 or 4 
display raster units, it is sufficient to model the body 
with a single rectangular polyhedron with appropriate 
color. Therefore the uppermost node, or “object,” for 
this body represents this simple description. If the body 
is viewed from a closer distance—for example, if its. 
topmost node’s description covers 16 raster units— 
then this topmost description is no longer sufficient, 
and the next level of more refined description is needed. 
At this next level the body is now perhaps described as 


a collection of rectangular polyhedra appropriately 


attached to each other, for example using one poly- 
hedron for each of the arms and legs, the head and the 
torso. Then so long as each of these ‘objects’? covers 
only a few raster units of the display, their description 
is ‘‘sufficient.”” When the viewing distance decreases 
such that any of them covers a critical maximum area 
of the display, its more detailed subobjects are used to 
replace its description. This process is carried out to 
whatever maximum level of detail will be needed. For 
example, a terminal level of description of the fingertip 
might be several surface patches (which could be im- 
plicitly structured even more finely using Catmull’s 
algorithm). 

The body described is just one ‘‘object” of an en- 
vironment, or larger hierarchy. There might be many 


such bodies, or other objects. The significant point, 


however, is that in a complex environment, the amount 
of information presented about the various objects in 
the environment varies according to the fraction of the 
field of view occupied by these objects. 

It is worth noting again that Catmull’s algorithm, 
described in the previous section, implicitly built such 
a structure. His algorithm used this structure in such a 
way that, despite the more complex mathematics of 
surface patches, it outperforms polygon-based algo- 
rithms if the surface occupies a small area of the screen. 
Thus it seems that such a structure should lead to im- 
provements in polygon-based algorithms as well. 


Clipping: a truncated logarithmic search. The choice 
of this structural representation poses another problem. 
How does one select only that portion of a potentially 
very large hierarchy that is meaningful in the context of 
the viewpoint and the resolution of the viewing: device? 
In other words, clipping in a broader sense must mean 
selecting not only that part of the environment within 
the field of view (the usual meaning) but also just the 
resolvable part. This implies finding the visible nodes of 
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the tree, as shown in Figure 2. The contour shown in the 
figure represents a possible set of objects that are within 
the field of view and are both not too large and not too 
small for the screen areas they occupy. 

In order to efficiently perform this clipping opera- 
tion some minimal description of object sizes must be 
available. For example, a bounding rectangular box or 
a bounding sphere would be sufficient information to 
test whether an object is totally within or totally outside 
of the field of view. The minimum necessary informa- 
tion is the center and radius of a bounding sphere. 

This general structure therefore suggests a very 
fast clipping algorithm which recursively descends the 
tree, transforming (if necessary) this minimal informa- 
tion into perspective viewing coordinates and testing 
both the area occupied by the bounding sphere and its 
intersection with the boundaries of the field of view. 
The criterion for descending a level is the area test, 
while the criterion for inclusion ‘rejection 1s the field 
of view boundary test. Only after either the area test 
terminates the descent or the terminal level of repre- 
sentation is reached is it necessary to actually trans- 
form and possibly clip the polygons or surface patches 
represented by the node. Clipping therefore resembles a 
logarithmic search that is truncated by the area (re- 
solvability) test. | 

This relatively simple mechanism for varying the de- 
tail in a scene suggests several other interesting possi- 
bilities. Since the center of attention of a scene Is often 
its geometric center, one might effectively render the 
scene with a center-weighting of detail. In other words, 
the maximum area an object is allowed to cover before 
splitting it into its subobjects becomes larger towards 
the periphery of the field of view. This 1s somewhat 
analogous to the center-weighted metering systems of 
some cameras. Likewise, since moving objects are less 
resolved by both the human eye (because of saccadic 
suppression) and a camera (because of blurring), one 
can render them with an amount of detail that varies 


inversely with their speeds. Indeed, an entire scene _ 


might be rendered with less detail if the camera is 
moving. Thus ‘clipping’ can be extended to include 
these concepts as well. 


Graphical working set. Since the problems addressed 
by this model are those associated with producing 
pictures and picture sequences of very complex en- 
vironments, the excessive storage needed for the geo- 
metric description of these environments must some- 
how be accommodated. Denning’s ‘‘working. set” 
model for program behavior provides a useful analogy 
[8]. According to this model, a computer program that 
makes excessive demands on immediate-access store 
is structured or segmented, and its storage demands 
are managed in such a way that only those segments 
most recently in use are actually kept in immediate- 
access store. The remaining potentially large number of 
segments are kept on a slower, secondary store, such 
as a disk. The “working set” is that set of segments 
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Fig. 2. A very deep hierarchy that structures the environment 
much more than the traditional motion structure. Arcs in this 
graph represent either transformations or pointers to more re- 
fined definitions of the node. The visible part contour represents a 
possible result of clipping. 
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available for immediate access, and is usually defined 
by a time average of past program reference patterns. 
Reference to an unavailable segment causes that seg- 
ment to become part of the working set, and segments 
not accessed after some pericd of time are deleted 
from the working set. 

This working set model coupled with.the broader 
sense of clipping mentioned above suggests a suitable 
way to accomplish a particular type of frame coherence. 
The working set in this context is that set of objects in 
the hierarchy that are ‘tnear’’ to the field of view, inside 
it, or “near” to the resolution of the image space. 
Only if an object is a member of this set is its description 
kept in immediate-access store. The set membership 
will change slowly since the differences between one 
scene and the next are usually small. Those cases in 
which the differences are large due to fast camera (or 
object) motion are easily accommodated by rendering 
the scene (or object) with less detail, as mentioned 
above. Moreover, the minimal description of node size 
needed for clipping suffices as the graphical analog of 
the segment table used in the computer program con- 
text. That is, this minimal clipping description must. 
always be available in immediate-access store to facili- 
tate determining the working set. This working set 
model therefore seems particularly well suited to the 
graphics context. 
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Improving existing algorithms. There are two ways 
in which a geometric hierarchy should lead to improve- 
ments in existing algorithms. The first is by reducing the 
number of comparisons needed to sort objects and the 
second is by eliminating from potential consideration 
an entire portion of the environment because an object 
obscures it. 

Since sorting is the central problem of visible surface 
algorithms, the performance of these algorithms im- 
proves with improved sorting methods. Indeed, many 
of the fast visible surface algorithms that have been 
discussed have resulted from clever utilization of image- 
Space coherences, such as scan-line coherence, to im- 
prove sorting speeds. In the present hierarchical frame- 
work, the geometric proximity of the subobjects of an 
object provides an object-space coherence that can also 
be utilized to decrease sorting time. 

For example, consider an ideal case of a binary tree 
as shown in Figure 3. Each node of the tree has as- 
sociated with it a bounding volume, but since this 
ideal tree is the result of clipping, only the terminal 
nodes actually represent geometric primitives, e.g. 
polygons or patches. Assuming that there are n levels 
in the tree, not counting the root node, there are m = 
2" terminal nodes. 

If the structure is ignored, then the fastest possible 
sort of these terminal nodes is accomplished with pro- 
portional to m log. comparisons using a quicksort. 
However, if the structure is utilized and if the bounding 
volumes of siblings do not overlap, which is admittedly 
an optimum arrangement, then the number of required 
operations is p2° for the first level, p2! for the second 


Fig. 3. An ideal binary hierarchy in which none of the terminal 
nodes overlap. The first p2° comparison sorts all objects into two 
classes, the second p2' comparisons sort them into 4 classes, etc. 
Summing all comparisons from all levels yields p(2" — 1) 
comparisons, 

ideal) Ensironment Number of 


comparisons 
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level, p2? for the third, etc., where p is a proportionality 
factor. Summing the number of Operations performed 
at all levels yields pd 729 2! = p(2" — 1), or roughly 
pm. In other words, by using the Structure, in the 
optimum situation of no overlap, the sorting time grows 
linearly rather than as m logem. 

Of course, this analysis holds only for a binary 
hierarchy in which none of the siblings’ bounding vol- 
umes overlap, which is an idealized situation. A binary 
hierarchy might not be appropriate, and any complex 
environment will no doubt have some overlap, although 
presumably not a very large amount. However, the 
point here is that sorting methods which utilize the 
geometric structure can yield a considerable per- 
formance improvement over those which do not, even 
under less than ideal conditions. 

The other improvement provided by a deeply 
Structured geometric hierarchy is that of eliminating a 
potentially large part of the structure from considera- 
tion because an object obscures it. Such an improve- 
ment requires defining for each object (in the generic 
sense) both a simple occluded volume, A, such that if 
A is obscured then the entire object is obscured, and a 
simple occluding volume, 5, such that if 6 obscures 
something then that thing is sure to be obscured by the 
object. Clearly, A exists for all objects, whereas 6 might 
not exist for some objects, such as an open-ended cyl- 
inder or a transparent object. A can be just the bounding 
sphere used in clipping, but 6 is in general additional 
information that must be kept for each object. 


Recursive descent, visible surface algorithm. - The 
above considerations suggest a totally new recursive- 
descent visible surface algorithm in which at each level 
all objects are sorted according to their bounding 
volumes. If any of the bounding volumes overlap both 
laterally and vertically then the occlusion test potentially 
allows one (or more) of the objects, and hence all of 
its descendents, to be totally eliminated from con- 
sideration. 

Using the ordering thus obtained, the same sorting 
and occlusion tests are recursively applied to each of 
the descendants of these objects; in those cases where 
two or more objects’ bounding volumes overlap in 
all three dimensions, indicating potential intersections, 
the descendents of these objects are treated as if they 
have the same parent nodes at the next level of recur- 
sion. Of course, recursion terminates when a terminal 
node is reached, and the net result of descending the 
tree is a very rapid sort of the primitives represented 
by these terminal nodes. Under ideal conditions, the 
computation time of this algorithm grows linearly with 
the visible complexity of the scene. | 

Since both this algorithm and the clipping algo- 
rithm described above recursively descend a tree struc- 
ture, it seems natural to combine them. Doing so not 
only potentially eliminates area tests on occluded ob- 
jects but also potentially decreases the size of the work- 
ing set. If all processing is performed by a single pro- 
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cessor, such as a general purpose computer, then the 
algorithms are probably most conveniently integrated 
into a single algorithm. However, if multiple processors 
are available, whether special purpose hardware or 
general purpose computers, then the algorithms might 
be left separate or combined according to whether 
parallelism is achieved by pipelining or otherwise. 


Building structured databases. Obtaining a good 
graphical database is a very time consuming and diffi- 
cult part of computer picture research. Databases 
obtained by careful measurement of real objects, by 
“building” objects from collections of simple mathe- 
matical objects, or by sculpturing surfaces in three di- 
mensions [4] are at least as valuable as the visible 
surface algorithms that render them. | 

At first glance it appears that the structural frame- 
work multiplies the dimensions of this problem since 
multiple descriptions of the same object must be de- 
fined. However, in the case of carefully measured real 
objects, the multiple descriptions can be produced by 
judicious “bottom-up” pruning of existing definitions 
of the objects in their most detailed form. Therefore use 
can be made of all objects that have already been 
defined. 

Those existing objects modeled with surface patches 
also present no problem. The coarser, high-level de- 
scriptions of these objects can be obtained by replacing 
the patches themselves with polygons and proceeding 
with the “bottom-up” pruning mentioned above to 
obtain even coarser descriptions. The finer, low-level 
descriptions of the objects can be obtained by ‘“‘top- 
down” splitting of the surface patches, as in the Cat- 
mull algorithm. This can be done either at display time 
or beforehand in building the database; the difference 
is the traditional time ‘space tradeoff. 


4. Conclusions 


-All of the recent major advances in computer pic- 
ture research have resulted from either explicitly or 
implicitly incorporating structure information in the 
geometric modeling techniques. This research represents 
an attempt to encompass all of these advances in a 
more general structural framework as a unified ap- 
proach to solving a number of the important problems 
of systems for producing computer pictures. 

_The proposed hierarchical models potentially solve 
a number of these problems. They provide a meaningful 
way to vary the amount of detail in a scene both ac- 
cording to the screen area occupied by the objects in the 
scene and according to the speed with which an object 
or the camera is moving. They also extend the total 
range of definition of the object space and suggest 
convenient ways to rapidly access objects by utilizing 
a graphical working set to accomplish frame coherence. 

An important aspect of the hierarchical models is 
that by providing a way to vary detail they can yield 
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an incremental improvement to existing systems for 
producing computer pictures without modifying their 
visible surface algorithms. Another incremental im- 
provement is then possible by incorporating the struc- 
ture in the sorting phases of existing algorithms. A 
final improvement is suggested by a totally new re- 
cursive descent visible surface algorithm in which the 
computation time potentially grows linearly with the 
visible complexity of a scene rather than as a worse 
than linear function of the object-space complexity. 
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An algorithm for drawing pictures of three-dimen- 
sional objects, with surfaces made up of patches of 
quadric surfaces, is described. The emphasis of this 
algorithm is on calculating the intersections of quadric 
surfaces. A parameterization scheme is used. Each 
quadric surface intersection curve (QSIC) is represented 
as a set of coefficients and parameter limits. Each 
value of the parameter represents at most two points, 
and these may easily be distinguished. This scheme can 
find the coordinates of points of even quartic (fourth- 
order) intersection curves, using equations of no more 
than second order. Methods of parameterization for 
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1. Introduction 


This paper presents an algorithm for computer 
generation of orthographic and perspective drawings 
of three-dimensional objects whose surfaces are made 
up of segments of quadric surfaces, also known as 
quadric patches. 

There are now many algorithms for generating 
pictures of solid objects with surfaces consisting of 
planar polygons. A polygon is a segment of a first- 
order surface (plane) bounded by other first-order 
surfaces. The Braid algorithm [1] extends this to allow 
cylinders and parts of cylinders. Cylinders are quadric 
surfaces. | 

This paper is partially based on the Woon algo- 
rithm [11], 12], which processes second-order (quadric) 
surfaces bounded by second-order surfaces. Quadric 
surfaces include such familiar objects as spheres, 
cylinders, cones, and hyperbolic paraboloids. They are 
not too complex mathematically. 

An important advantage of using quadric patches ts 
that an object whose surface is approximated by many 
small polygons may be better modeled by a few larger 
quadric patches. Another advantage is that shaded 
pictures may be made with better control over such 
effects as mach banding and specular reflection [9]. 

Previous quadric patch algorithms, such as those 
by Mahl [6] and Weiss [10], as well as the Woon algo- 
rithm, run into difficulty in solving for the intersection 
of two quadric surfaces. This is a fourth-order problem, 
and there is no easy method for finding roots of a 
fourth-order equation, let alone discovering if there are 
any real roots. 

The algorithm presented here reduces these fourth- 
order problems to second-order problems. The quad- 
ratic equation formula, familiar from high school 
mathematics, provides a rather simple method to solve 
these equations. 

In addition, this algorithm uses a parametric repre- 
sentation. Every point on the intersection of two 
quadric surfaces is determined uniquely by a numerical 
parameter and, in many but not all cases, by a Boolean 
sign parameter. Since these fourth-order quadric sur- 
face intersection curves (QSICs) can be represented 
parametrically, the computer memory need only store 
about a score of numbers to represent a QSIC, rather 
than storing the coordinates of dozens of points for 
each QSIC. 


2. Theory—Quadratic Form 


2.1 Discriminant 
The quadratic surface, as represented in eq. (1) 
below: 


g(x,y, Z) = gx? + gy? + quz* + gixy + qsyz 
+ gezx + rx t+ gay t gz+tq=O (1) 
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is a quadratic form [3, 8, 11] which may be represented 
in vector matrix form, as follows. 

Suppose the location of a point in 3-space is repre- 
sented as the following vector: x = (x yz 1). Equation 
(1) may be represented as 


g(x, y, Z) = q(x) 


Qi 291 346 241 x 
1 l 
errs a | 241 G2 945 24s y ? 
ay 296 393 Ys 249 Z 2) 
597 348 299 4o I 
or, for short, | 
q(x) = xQx" | (3) 


where Q is the 4X4 matrix in eq. (2). Q is called the 
discriminant of the quadric surface. Note that Q is 
symmetric, and that, for any real nonzero scalar 8, 


that BQ is equivalent to Q, since they describe the same 


surface. In this paper, the same symbol may be in- 
discriminantly used for both a quadric surface and its 
discriminant. 

This author often prefers to use an alternate form: 


ADFG 
Q = DBEH 
“NEF EC J]? 
GH K, 

such that 


q(x, y, Z) = Ax? + BY + C2? + 2Dxy + 2Eyz + 2F zx 
+ 2Gx + 2Hy + 2/z+ K 
= 0. 


The upper-left 33 principal submatrix of the dis- 
criminant contains all the second-order terms. This 
will be called the ‘‘subdiscriminant’’: 


9. 241 346 ADF 
Qu = (291 42 749} =( DBE}. 
296 245 qs FEC 


The subdiscriminant will always be represented 
with the subscript w. 

The rank of the discriminant and the subdiscrimi- 
nant are helpful in classifying quadric surfaces. These 
are invariant under the transformations outlined be- 
low. In fact, the rank of the discriminant is invariant 
under any nonsingular transformation, and the rank 
of the subdiscriminant is invariant under any trans- 
formation in which the upper-left 3X3 submatrix is 
nonsingular. 


2.2 Transformations of the Discriminant 

In this paper, there will usually be two coordinate 
Systems. In setting up the object, the untransformed 
space will be the uvyw space; and the transformed space 
will be the xyz space. The transformation matrices 
will be F and %, each being the inverse of the 
other. 

First, let us consider the transformation: x = uF, 
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Table I. Guide to Classification of Quadric Surfaces. (The 
imaginary part is in parentheses.) 








d m = § Conditions Surface 
Singular planar 
} 0 0 invalid 
| I J coincident planes 
2 0 O° single plane 
Zz | ] D,>0 invalid (imaginary parallel 
planes) 
2 l ! D, <0 two parallel planes 
2 2 7, <0 two intersecting planes 
2 2 2 I, >0 line (two intersecting 
imaginary planes) 
Singular nonplanar 
3 ] | parabolic cylinder 
3 2 0 TT, <0 hyperbolic cylinder 
3 2 2 T,> 0; 7,.D3 <0 elliptic cylinder 
3 2 2 7T,>0; 71D; > 0 invalid (imaginary 
cylinder) 
3 3 l a cone | 
3 3 3 p point (imaginary cone) 
|Qui |Q| os Conditions Surface 
Nonsingular 
0 + 0 (T, < 0) hyperbolic paraboloid 
0 —- 2 (T, > 0) elliptic paraboloid 
+ + 1 a hyperboloid of one sheet 
+ — | a hyperboloid of two sheets 
+ + 3 pp. invalid (imaginary 
ellipsoid) 
+ —-— 3 B ellipsoid 





d= 4; m = 2 if det(Q.) = 0, m = 3 if det(Q,) #0 
O denotes zero, + denotes nonzero. 





+ denotes positive, — denotes negative; 

a: Ty, > 0; det(Qu) x N < 0 or T, < 0. 

3: T, > 0; det(Q.) x T; > 0. 
Gu Giz fis Gis ADFG ADF 

os 9u 92 Ga Gi)  {[DBEH Q.-|DBE 
G31 G32 G33 Ga FECJY? ‘7 FEC 
Gai Gaz Gaza Qaa GHIJIK 


d = rank(Q); m = rank(Q.); 8 = abs(signature(Q,)), 


3 
N= Vai =A+tBH+C, 
rai 


2 

















cs ae Git Giz 
ea Vii Qi; 
= AB+ AC + BC — D? — FE? — F?*, 
3 4 
D, zs ~ ae qi qi; 
a=] peer + qi qj; 
=+N)K-G@— H— J, 
Gii Gij Qik 
2 3 4 
Ds = 2, 2 os Git Qi Vie 
ab perth kw p+ 
Qki Gkj Gkk 
= ABC + ACK + ABK + BCK + 2(DEF + FGJ + DGH 
+ EHS) —- D(C+ K) —- B(A+ K) ~ FP(B4+ K) 
- G(B+ C) — B(A+ C) -— JHA H+ B). 
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where x = coe 1),u = (uvw 1), and 


Tein fee0 
Pe ee eee 
fai fon 20 
6, 6, 6. | 


F is a congruence transformation. The subdis- 
criminant F, is an orthogonal transformation. If F, 
is orthonormal, then the determinants of both F, and 
F are unity. 


Assume that § = F'. &, is the transpose of F, if 
they are orthonormal, but the bottom row of § bears a. 


more complicated relationship. The lower-right ele- 
ment of $ (the constant term) is unity. 
Suppose the equation for the quadric surface P is 


uPu' = 0 
XPx = Q 


in untransformed space, and (4a) 
in transformed space. (4b) 


Using the transformation x = uF on eq. (4b), we 
have uF@F’u’. Comparing this with eq. (4a), we have 
P = FOF’. 

By pre- and post-multiplication, we have: 


5 PE" = FFPF'S’ = 19] = &, or 
P= FPS", 


It 1s possible to break down the transformation 
into two parts: rotation and translation. They may be 
applied repeatedly in any order. 

The rotation matrix is 


tea fis0 
Fxtivt 0 
i 0 
0 0 0 1 


R= 


The subdiscriminant is affected only by rotation. 
Rotation does not affect the constant term, in the lower- 
right corner of the discriminant. 

The translation matrix is 


1000 

0100 
P= 100° 10 
6, 6,46. 1 

Translation does not affect the subdiscriminant. 
The constant term is affected only by translation. 

A transformation can be formed by a rotation fol- 
lowed by a translation (F = RT), or by a translation 
followed by a rotation (F = TR). T and R do not 
usually commute, Just as rotation and translation do 
not usually commute. 


2.3 Canonical Form 

It is possible to put the discriminant of a surface 
into a canonical form, in which the axes of the co- 
ordinate system are the axes of the object, with non- 
zero diagonal elements of the discriminant being given 
precedence. Basically, this is done by diagonalizing 
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Table II. Guide to Classification of Conic Sections. 














dm ss Conditions Curve 
Singular 

1 Oo QO invalid 

I ] ] coincident lines 

2 0 O single line 

2 #1 i D,>0 invalid (imaginary 
parallel lines) 

2 1 | D. <0 two parallel lines 

2 2 #O | Q.| <0 two intersecting lines 

2. 2 2 1|Q.| > 0 point (intersecting 

imaginary lines) 
Nonsingular 

3 1 | parabola 

3 2 O | Qu} <0 hyperbola 

3 2 2 {Q|>0,|QO[% <0 ellipse 

3 2 2 |Q.|>0,{Q|%>0 — invalid (imaginary 


ellipse) 


Gi Giz G3 ADG AD 
Q= Gugd2g3})=|{DBH ; ee 
31 G32 G33 GHK 


d = rank(Q), m = rank(Qu), 8s = abs(signature(Q,)) 
MN =4n+42=A+B 


2 3 ; 
D,= > yD 2%) 2 p+ BK + AK — D?-~ G — Bt. 
t= 1 gaat. | Qi Qyj 











the sub-discriminant, and then eliminating, by con- 
gruence transformations, as many of the other elements 
as possible. For more details, see [4]. 


2.4 Discriminant Form for Conic Sections 

The conic sections are quadric curves, being also 
of the quadratic form. The discriminants of a conic 
are 3X3 matrices, and the subdiscriminants are the 
2X2 upper-left submatrices of the discriminants. 
Other than the reduction in the number of dimensions, 
the conic sections are completely analogous to quadric 
surfaces, except that the transformation matrices are 
slightly simpler: 


fx fay 0 KSO 
F=(fu fy 0) =| —-S KO 
a ae 6, 5, | 


where K? + S? = |. 


2.5 Classification of Quadric Surfaces and 
Conic Sections 

Tables I and II are guides to classification of quadric 
surfaces and conic sections, respectively. These are 
adapted, in part, from a chart in (3, p. 230}. 

One of the columns in these tables is the absolute 
value of a signature. The signature of a matrix is the 
number of positive eigenvalues minus the number of 
negative eigenvalues. 


October 1976 


Communications 
of Volume 19 
the ACM Number 10 





3. Classification of Quadric Surface Intersections 


When one has two quadric surfaces, the first two — 


questions that arise are the following: 

1. Do they intersect? 

2. If so, what is the nature of their quadric surface 
intersection curve (QSIC)? 


3.1 Pencil of Two Quadric Surfaces 
Suppose we have two quadric surfaces, with dis- 


criminants P and Q. In matrix form, the equations | 


for the two surfaces are xPx’ = 0 and xQx’ = 0. 
The equation, x(Q — aP)x' = 0, represents, for all 
real values of @ (finite or infinite), a surface on the 
‘‘nencil” of P and Q. | 

For a = 0, we have the surface Q. Fora = +, 
we have the surface P.’ 

If surfaces P and Q intersect, then their intersec- 
tion (QSIC) is the “base curve’’ of the pencil, and it lies 
in all the surfaces of the pencil. This “base curve” is not 
to be confused with the ‘“‘base curve” of a parameteri- 
zation surface, which will be used extensively later on. 

If the two surfaces do not intersect, then none of the 
real surfaces of the pencil intersect. In addition, the 
pencil contains some imaginary surfaces, and these 
may be among those listed as invalid in Table I. 

The general form for a member of the pencil of P 
and Q is given by R(a) = Q — aP. 


3.2 Classification of Pencils 

One may classify pencils by the classification of the 
simplest surface in the pencil. If the pencil has a base 
curve, then the base curve (a QSIC) has the same classi- 
fication as its pencil. 

If, for some a, R(a) has rank one or two, then 
R(a) represents either a plane or pair of planes. This 
pencil is called ‘‘planar.’? Failing this, if, for some a, 
R(a) has rank three, the pencil is ‘nonplanar singular’. 
If R(a) is never singular, then it always has rank four, 
and the pencil is nonsingular. 

One can tell if two surfaces do not intersect if one of 
the following occurs: (1) for some value of a, R(a) 1s 
invalid, or (2) either P or Q does not intersect some R(a@). 


3.3 Determining if a Surface Intersection is Planar 
or Singular 

If, for some value of a, R(a) has rank of two or less, 
then the pencil is planar. As Woon has pointed out 
[11, pp. 34-36], this happens when two conditions are 
met for the same value of a: (1) Det(R(a)) = 0, and 
(2) the sum of the 3X3 principal minors of R(a) 
vanishes. Thts may be expressed as: D3(R(a)) = 0. 
If condition (1) is met but condition (2) is not, then 
the surface is nonplanar singular. 


3.4 Nonsingular Pencils 
If R(a) is never singular, then there is an R(a) which 


is a hyperbolic paraboluid. See the Appendix for proof 
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that the intersection between any two quadric surfaces 
lies in a ruled quadric surface. The crux of this al- 
gorithm is that, in calculating points on a QSIC, one 
uses the parameter to select a line in the ruled surface, 
and then solves for the intersection of the line and 
another quadric surface. This is at most a second-order 
problem; the technique is outlined in Section 4.3. 


4. Theory of Parameterization 


In this section, methods of parameterization are 
discussed. The parameterization is done in a uvw co- 
ordinate system. A congruence transformation is then 
used to transform point coordinates into the xyz 
system. The parameter 1s called /. 


4.1 Parabola 

Suppose we have a parabola of the form: Au? + 
2Hv = Q. Taking m = —A/2H, we have: vy = mu’. 
The parameter 1s f. Therefore we can say 


u=t, v= mp, 


4.2 Ellipse 

Suppose we have an ellipse of form Au, + By, + 
K = 0. Taking r, = (K/A): andr. = (—K/B)? as 
being the semi-axes, we have w?/r.27 + v?/r,2 — 1 = O, 
where we take 
_ 2r ge, te 
aa he ite ee 
(This is a variation of a ‘well-known’ parameteriza- 
tion mentioned in [1]). | 

This form is to be used only for —I1 < ¢ < +1. 
This gives only positive values of v. Section 4.4 below 
gives a more complete method. 

This parameterization is well behaved for values of 
t within the range [—1, +1]. That is, if ds = (du? + 
dv?)? ds/dt does not vary too much if r, and r, are 
of the same order of magnitude. 


ul 


4.3 Hyperbola 
Suppose we have a hyperbola of the form Aw? + 
BY + K = 0. Takingr, = (K/A)' andr, = (= K/B)*: 
we have 
2 
2t ee eae 


ae a 


for —1 < ¢ < +1.A hyperbola has two disjoint parts. 
This parameterization gives only one of them, in which 
v > 0. 

In contrast to the parameterization of the ellipse, 
this form is not well behaved. At values of ¢ approach- 
ing --1, small changes in ¢ result in large changes in 





1 Since BP represents the same surface as P for any real non- 
zero scalar 8; if 8 = 1/a, lima., (1/a)(Q — aP) = lime., (Q/a) 


— P = —P, which is equivalent to P. 
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u and v. However, at these places, the hyperbola is 
very close to its asymptotes, and is practically a straight 
line. 


4.4 More Complete Forms 

For central conics (ellipses and hyperbolas), the 
following modifications may be used: 

Take as the parameter ’, which takes values from 
—2. to +2., inclusive, for ellipses. (For hyperbolas, 
the values —2, 0, and +2 are excluded.) 

For ¢ > 0, we have tf = ¢ — | and o = +1; 

For ¢ < 0, we have ¢ = 7 + | andog = —1. 

For ellipses, we have 


u a. r ' a r 
—_— oC Ue Bae oe, ii 2, —= C ie pcre ae 
I+ Pf °° 1+? 

For hyperbolas, we have 
ne, cel _ ite 
ee Oa ree Vee eae 


Since, for an ellipse, « and v should be periodic 
functions, we can set up a parameter 1”, which is nor- 
malized to f by adding or Subtracting multiples of 
four, such that /’ is in the range |—2., +2.]. 


4.5 Nonplanar Intersections 

The parameterization of planar intersections is 
described above. For nonplanar intersection, the curve 
lies in a quadric surface which has a “base curve” 
which is either a line or conic section. For a cylinder, 
this base curve is a cross-section of the quadric sur- 
face, in a plane perpendicular to the main axis. There 


iS at least one set of straight lines, one line passing. 


through each point of the base curve, such that each 
line lies wholly in the quadric surface, and every point 
on the quadric surface lies on one of these lines. 

One uses parameterization to select a point on the 
base curve, and its corresponding line. By solving a 


quadratic (second-order) equation, one can then find: 


the intersection of the line with any other quadric sur- 
face. In this manner, all the points of the QSIC(s) 
may be found. 


3. Information Needed by the Algorithm 


Special note. The following algorithm has not been 
implemented, and is presented only as a guide to future 
implementation. | 

The algorithm needs the following input information 
in some form. 

(1) For each object, it needs to know the dimensions 
of the “object box,’’ which is a cube or rectangular 
solid in which the object is contained; and the resolution 
and vector lengths which will be used. 

(2) Surface equations must be known for each sur- 
face. These may be presented either as ten coefficients 
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(in the form of eq. (1)), or presented as follows: (a) the 


_ type of surface (ellipsoid, hyperboloid, cylinder, etc.), 


(b) the lengths of radii, semi-axes, etc., (c) the orienta- 
tion and displacement from the origin. An interac- 
tive input may be accepted in which the user may 
specify a surface, and then manipulate and distort it to 
sult. 


(3) Bounds must be specified for each quadric 
patch in the same manner used in the Woon algo- 
rithm. A ‘“‘quadric patch” is defined as the locus of 
points on the surface satisfying a certain Boolean con- 
dition, which is specified for each surface. Each “bound” 
consists of another quadric surface, and a polarity 
(+ or —). If the polarity is positive (+), the bound is 
satisfied (true) only for points in the exterior of the 
bounding surface. If the polarity is negative (—), 
the bound is satisfied only for points in the interior 
of the bounding surface. Each “set of bounds” con- 
sists of one or more bounds. The set of bounds is 
Satisfied iff all bounds in the set are satisfied. Each 
patch has one or more sets of bounds. The patch is the 
locus of all points on the surface satisfying at least one 
set of bounds. 


(4) Surface intersections must also be specified. 
For each one, the user must tell the algorithm the two 
intersecting patches, the multiplicity of the intersection 
(how many disjoint parts it has), and if it is to be a 
“smooth” or a ‘‘sharp”’ intersection. 

A “sharp” intersection abruptly separates two sur- 
faces, so that there is always an edge which may be 
seen, if viewed from a proper angle. It is appropriately 
displayed in the drawings. ‘“‘Smooth” intersections 
are used when several quadric patches are used to ap- 
proximate a single higher-order surface. These are not 
included in drawings, unless they occur along limbs. 
Often, the first derivatives will be continuous across a 
smooth intersection. If one wished to approximate a 
torus (donut) by using patches of ellipsoids, hyper- 
boloids of one sheet, and cones, smooth intersections 
would be used. 3 

A patch is usually bounded at surface intersections. 
One usually specifies intersecting surfaces as bounds 
for each other. However, if there is a smooth inter- 
Section with a continuous first derivative, then it might 
be difficult to tell if a point is on one side of a boundary 
or the other. Therefore the program should automati- 
cally compute, for every smooth intersection, another 
Surface from the pencil of the two intersecting surfaces. 
This surface will act as an auxiliary bounding surface. 
It should meet both intersecting surfaces at a high 
angle. | 

Surface intersections are usually also surface bounds, 
but other surfaces may be needed to completely describe 
the bounds of a patch. If two surfaces intersect-in two 
distinct circles in parallel planes, an additional auxiliary 
Surface may be needed to distinguish the two. For a 
more detailed discussion, see [11, pp. 9-13]. 
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6. Processing a Surface Intersection 


The following is a brief description of the processing 
needed for each quadric surface intersection curve 
(QSIC). Please refer to [4] for more details. 


6.1 Classification of QSICs 

First, one should compute all the real roots of the 
equation: det(R,(a)) = det(Q, — aP.) = 0. If, for 
any such a, R(q) is invalid, then there is no intersection. 
If, for any such a, R(a) is singular, then we may have 
a planar intersection. Otherwise, if one of the R(a) is a 
hyperbolic paraboloid, we use that R(q@). If not, there 
must be some value of a for which R(a) is singular, and 
therefore the surface R(a) is a cylinder. In any case, we 
choose the ‘‘simplest’” R(a), which is the one closest to 
the top of the following list: 


Planar line (imaginary intersecting planes) 
single plane ; 
coincident planes 
parallel planes 
intersecting planes 

Nonsingular hyperbolic paraboloid 

Nonplanar __ parabolic cylinder 

singular elliptic cylinder 


hyperbolic cylinder 
~The R(a) that is selected is the “parameterization 
surface’. Also, one must select some other member of 
the pencil (usually P or Q) to act as the “other” surface 
in the parameterization scheme. 


6.2 Handling of Planar QSICs | 

If the parameterization surface 1s a /ine, then the 
QSIC consists of the line, or one or two points on the 
line. Include only those parts within bounds for both 
intersecting surfaces should be included. 

If the parameterization surface is a single plane, 
the QSIC, if it exists, is a conic Section. The parametert- 
zation appropriate to that conic section should be used. 
In the xyz coordinate system, this may be represented 
as 


x = (a. + bt + ¢.)/6 + de, 7 (Sa) 
y=(a4+ b6,t4+ c¢,)/64+d,, (Sb) 
z=(a.P+b.t+ c¢.)/6 + d.. (Sc) 


Using this form means that the transformation 
from uvw space to xyz space need be done only once, 
to set up the coefficients of eqs. (5). 

If the parameterization surface is a pair of planes, 
it is factored into two separate planes, each handled 
as above. 


6.3 Handling of Nonplanar QSICs 

For a cylinder, the ‘tbase curve’ of a nonplanar 
parameterization surface is the cross-section of that 
surface in a plane perpendicular to that surface’s axis. 
For a hyperbolic paraboloid, this 1s one of the straight 
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lines which lie on the paraboloid. 

For a given parameter, one finds the coordinates 
of the corresponding point on the base curve. These are 
called xo, yo, and 2. 

One then finds the equation for the line in the para- 
meterization surface including point (x0, yu, Zo). This 
is of the form 


X = Xo + Ye, (6a) 
y= prt Ys, (6b) 
z= 2+ 7:3, (6c) 


where s is the Secondary parameter. 

One then substitutes the values of x, y, and z found 
in eqs. (6) into the equation for the other surface. The 
result is a Second-order equation of the form 


as’ + bs + ¢ = 0. (7) 
The ‘‘quadratic discriminant” of this equation 1s 
= b? — 4ac. | (8) 


There are at most two solutions to this equation. One 
is used when the parameter is increasing, and the other 
when it is decreasing. This way, each point on the QSIC 
is represented by a single numerical parameter, plus a 
Boolean ‘‘sign”? parameter. 

For a cylinder, we get the y’s (direction cosines) as 
follows: yr = @:, yy = e,, and y- = e.. The coefh- 
clients are computed as (e, e, e:) = (00 1)5, . 


6.4 Parameterization of QSICs Lying in 
Hyperbolic Paraboloids — 

This is a particularly easy form to handle. First, 
a combination of congruence and scaling transforma- 
tions is used to put the hyperbolic paraboloid in the 
form 


0 | 

i 0 
0 0 
0 0 
This is equivalent to the equation: uv = w. The ‘‘base 
curve” is the uw-axis, and the regulae, or straight lines 
lying wholly in the paraboloid, are parallel to the v-axis. 
This is expressed asu = f, v = s,andw = st. 

The x, y, and z coordinates are expressed as 


x=ct+d,+es + fist, 
yr=ettd, + es + f,st, 
z=citt+d,+e.s + f.st, 
with 

Cr: 562-0 

e, e, e, O ae 

fi fy Ff: 9 

d, d, dz} 


where ¥ is the transformation matrix from the uyw 
system to the xyz system. 
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6.5 Parameter Limits 

For a planar QSIC, there is a two-member parame- 
ter limit string of the form (n, 2). The ‘basic limits” 
are (—2., +2.) for an ellipse. For a paraboloid or 
hyperboloid, they are determined by the limits of the 
object box. The “bounded limits” are of the same 
form, but only include those portions of the QSIC 
within bounds for both intersecting surfaces. This may 
be done by tracing through the QSIC, and finding the 
places where the ‘‘bounding state’ changes. Interval 
halving may be used to refine these values and find the 
bounded limits. There may be more than one such 
pair of limits for a QSIC. The number of such pairs is 
the QSIC’s ‘‘multiplicity.” 

For nonplanar QSICs, the limit string has three 
elements, in the form (1, t%, ts). Each adjacent pair, 
(4, &) and (f, ts), represents a parameter trace in 
one direction. If ¢, < ¢,,1, then the quadratic equation 


(7) is solved with a positive radical, as » = (—b + 
*)/2a. If t; > trar, we use a negative radical, 
$= (02D) 2ae IP i = 1 NO tracing is done. 


‘D is the discriminant of eq. (7), as in eq. (8).) Using 
this form, we may completely trace a nonplanar QSIC, 
each time obtaining a unique point on the curve. 

For ‘basic limits,” we select only those ranges in 
which the lines corresponding to points on the base 
curve lie partly in the object box, and the value of 9 in 
eq. (8) is non-negative. | 

For ‘bounded limits,’ we take those ranges of the 
parameter satisfying the bounds of both intersecting 
Surfaces. When tracing a QSIC, we use only the values 
of the parameter within the bounded limits. 


7. Limbs 


Since quadric surfaces are generally not flat, it is 
likely that, from a particular viewpoint, a quadric 
Surface may ‘‘fold in back of itself.”” The locus of points 
where this happens is the limb. This terminology is 
due to Comba [2]. A virtual edge is a segment of a limb. 

More precisely, the limb is the locus of points on a 
Surface where the normal to the surface is perpendicu- 
lar to the line of sight. These points must satisfy both 
of the following conditions: 


g(x, y,z) =Oand p= S-grad g = 0, 


Where s is the line of sight vector from the object 
point to the viewpoint; and grad g is the surface nor- 
mal, which may be expressed as the following column 
vector: 

grad q = (2x + qiy + gez + qr 
2q2.¥ + qix + sz + qs 
2qsZ + gex + Gay + Qs 


For a more detailed discussion of limbs for ortho- 
graphic and perspective projections, see [4]. 


561 


8. Final Processing 


This algorithm is primarily designed to handle 
QSICs, with hidden curve elimination a secondary 
consideration. | 

Final processing is a combination of hidden curve 


determination and drawing of the curves. These may 


be done simultaneously. A “brute force’? method is 
used to determine hidden curves. Each point is tested 
against every other patch which, as determined by an 
envelope test, might possibly hide the point. Each 
point may be displayed as soon as it is determined that 
it is not hidden. 

If a shaded picture is desired, one may set up 


“display parameter limits,” much like the other parame- — 


ter limits. For each scan line, one determines the in- 
tersections of the scan line and the projections of the 
QSICs. For each point in between these intersection 
points, one may compute the line of sight, the normut! 
to the quadric patch, and the distance from the patch 
to the viewpoint. These may then be used to compute 
Shading. It may be possible to express all these as a 
single function, thus reducing computation time. 
Pictures of transparent objects may also be generated. 
For more details on shading and transparency, see 
[9] and [7], respectively. 


9. Importance of Algorithm 


The chief benefits of this algorithm are: 


(a) It is a complete quadric surface algorithm, allowing 


use of all real quadric surfaces. 

(b) It allows use of “smooth” intersections to approxi- 
mate higher-order surfaces. 

(c) It may be fast enough to use in conjunction with a 
shading algorithm. | 
Since this algorithm distinguishes limbs and sharp 


and smooth intersections, each may be appropriately 


handled by the shading algorithm. This may include 
explicit handling of the Mach Band effect, Specular 
reflection, and transparency. 


Appendix 


THEOREM. The intersection of two quadric surfaces 
lies in a plane, pair of planes, hyperbolic or parabolic 
cylinder, or hyperbolic paraboloid. 

Definitions: 

A para is a quadric surface with a Singular sub- 
discriminant. An elliptic para is an elliptic cylinder or 
an elliptic paraboloid. A nonelliptic para 1s, by exclu- 
sion, a plane, a pair of planes, a hyperbolic or parabolic 
cylinder, or a hyperbolic paraboloid. ; 

A rotation transformation is a transformation on 
the discriminant matrix representing a rotation. It 
does not affect the constant (zero-order, or lower-right) 
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element. A trans/ation transformation is a transforma- 
tion on the discriminant matrix representing a rectili- 
near translation. It does not affect the elements of the 
Subdiscriminant. A scaling transformation is the dis- 
tortion of the axes by multiplying each by a linear 
factor, and is represented by a nonsingular diagonal 
matrix. It does not affect the quality of a surface, in 
that the surface remains of the same type (an ellipsoid 
remains an ellipsoid), but it does distort its distances. A 
valid transformation 1s a combination of one or more 
of the above. Notice that multiplying each element of 
the discriminant by the same nonzero constant does 
not affect the surface at all. | 

LEMMA 1. The intersection of two arbitrary quadric 
surfaces lies in a para. | 

Case |. Either one of the surfaces is a para. The 
hypothesis is satisfied. 

Case 2. Let us call the surfaces P and Q. The equa- 
tion, det(R,(a)) = det(Q, — aP,) = 0, may be written 
as: 


—det(P,)a + Kia’ — Kya + det(Q,) = 0, (1) 


with K, and K; being the sums of the determinants of 
combinations of columns of P, and Q, . Because 
neither P nor Q is a para, der(P,) # Oand det(Q,) # 0. 
Therefore, eq. (1) 1s definitely of third order, so there 


must be at least one real root in w. For this a, Ra) 1S, 


by definition, a para. 
LEMMA 2. Roots of the equation, 


det(Q, — aP,) = 0, (2) 
are unaffected by any valid transformation applied to both 
P and Q. | 


PROOF. Suppose we have the valid transformation 
S and its upper-left 3X3 orthonormal submatrix S, , 
both of which are nonsingular. Transforming both P 
and Q by S gives us 


det(S,OS,' — aS,PS,"') = det(S.(Q — aP)S,") 
= det(S,)det(Q — aP) 

-det(S,"*) 

det(Q — aP) = 0, 


which obviously has the same roots as eq. (2). 


LEMMA 3. The subdiscriminant of an elliptic para 


and another arbitary quadric surface may be represented 


as 
100 ADO 
010) Q,= {DBE 
O£ 


000 Cc 


P, = 


by using valid transformations. 

PROOF. Suppose we have an elliptic para P and an 
arbitrary quadric surface Q. Rotate the para so its 
subdiscriminant is in diagonal form: 


$00 ADF 
P,” =(0T 0], ST>O0; QO.” =(DBE}. 

00 0 FEC 
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If S, 7 < 0, multiply S and T by —1. 
Now, if we transform P,” and Q,.” by the following 
scaling transformation: 


Ss? O OV 
OO: apeteoy, 
0 0 1 


we get 
100 ADF 

P, ={010) O.o =| DBE 
000 


jae wa OY 


Then apply the rotation transformation 


L —-M 0 
M L O 
0 Qa | 


with L? + M? = land L/M = E/F, to get 


100 ADO 
P,={010]), O, =|DBE 
000 OEC/. 


COROLLARY 1. The intersection of an elliptic para 
(P) and an arbitrary quadric surface (Q) with C = 0, 
lies in a nonelliptic para. 

PROOF. First, take 


ADO A0 0 
R, = QO, — AP, =|(DBE)]—{(0A40 
0 EQ 000 
0 D QO 
= |D B—A E}, 
0 E QO 
which means that 7,(R) = —D? — E? < Q. 


Case 1. If D = E = 0, then 7.(R) = 0, and 


00 O 
R, =(0 B-A 0), 


0 0 0 


msl. 


Therefore R is a plane, a pair of coincident or parallel 
planes, or a parabolic cylinder, all nonelliptic paras. 
Case 2. If D # O or E # O, then 7,(R) < 0. R 
cannot be an elliptic para, so it must be a nonelliptic 
para. | 
_ COROLLARY 2. An intersection between an elliptic 
para and another quadric surface, with C # 0, can be 
represented as 


100 ADO 
P,={010), Q.=|[DBE}, 
000 0 £1 


if one simply divides every element of Q by C. 
Now, with R(a) = Q — aP, we can get the 
equation : 


det(R(a)) = —0e + 0° — (B- E+ Aja 
+ (AB — AE’ — D’) = 0. 
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Taking 8 = B — E,, we have 
det(R(a)) = a — (A+ B)at+ (AB — D’) = 0. (3) 


Note that the quadratic discriminant of this equation 
is‘ = (A — B)? + 4D? > 0, so eq. (3) has at least 
one real root. 

LemMa 4. The intersection of two elliptic paras lies 
in a nonelliptic para. 

Proof. By Lemma 3, we can take 


100 ADO 
P,={010], Q. ={|[DBE}. 
000 OEC 


If C = 0, then the hypothesis 1s satisfied by Corol- 
lary |. Therefore, by Corollary 2, we can take C = I. 
For Q to be an elliptic para, we must have 


det(Q,) = AB — AE’ — D’ = 0, and (4) 
T(Q) = AB+A+B—D'— E> 0. (5) 


Taking 8 = B — E’, we have 


det(Q,) = AB — D’ = 0, or 
AB = D, > 0 


T(Q) = AB+ A+B AB-E 


=AIL+E)+8>0 


(condition a) 


(condition b) 


Now, find a such that R(a) = Q — aP isa para. This 
a can be found by taking eq. (3) with 48 — D? = 0. 
The two roots are a = 0, A + 8B. Rejecting the first 
root, we havea = A + 8B, or . 


—p  (AB)' 0 
R(A + B) ={(AB)' B-AE 
0 E l 
For R(A + 8), 
T, = —|A+ Bl + F)). (6) 


Assume R is an elliptic para, then T)(R) > Q, and 
therefore | 


A+ p(1 + E*) < 0. (condition c) 


It is obvious that conditions a, b, and c are contra- 
dictory. Therefore R cannot be an elliptic para, and 
so it must be a nonelliptic para. 

THEOREM. The intersection of two quadric surfaces 
lies in a nonelliptic para, these being a plane, pair of 
planes, hyperbolic or parabolic cylinder, or hyperbolic 
paraboloid. 

Proor. Assume that there are two quadric surfaces, 
designated Q, and Q,. If either is a nonelliptic para, 
then the hypothesis is satisfied. If both are elliptic 
paras then the hypothesis is satisfied by Leruna 4. 

Otherwise, at least one of the surfaces must be a 
non-para. Call this Q. By Lemma I, there must be a 
para in the pencil of Q: and Q,. Call this P. If P isa 
nonelliptic para, the hypothesis is satisfied. Otherwise, 
P is an elliptic para. 

By Lemma 3, P, and Q, may be expressed as 
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ADQO 
DBE 
O EC 


100 
P,=j;010]}], Q= 
000 


If C = 0, the intersection is contained in a non- 
elliptic para by Corollary 1. By Corollary 2, if C ~ 0, 
then we can set C = 1, and 


det(R,(a)) = a — (A+ B)a+ (AB — D*) = 0, or (7) 


a = A+B + ((A — BP + 4D%))). (8) 
Casel. A= B= B— EF; D = 0; thusa = A. 
A QO 0 00 0 
O,={0 A+EFEE), RA) = [0 & E}. 
0 E 1 OE | 


R(A) is either a plane, a pair of planes, or a parabolic 
cylinder, all of these being nonelliptic para’s. 

Case 2. One of the roots in a is zero. This implies 
that QO isa para. If Q isa nonelliptic para, the hypoth- 
esis is satisfied. If Q is an elliptic para, then it 1s sat- 
ished by Lemma 4. 

Case 3. A #* B or D # O. Here, the quadratic 
discriminant of eq. (7), (which is ® = (A — B)? + 4D?) is 
positive. Therefore these equations have two distinct 
roots. If either R(a) is nonelliptic, the hypothesis is 
satisfied. If either is elliptic, the other must be non- 
elliptic by Lemma 4, thereby satisfying the hypothesis. 
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A significant increase in the capability for control- 
ling motion dynamics in key frame animation is achieved 
through skeleton control. This technique allows an 
animator to develop a complex motion sequence by 
animating a stick figure representation of an image. 
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'Cel has been derived from celluloids, the material on which 
drawings are prepared in conventional cel antmation. Component 
images that move separately are usually drawn on separate cels 
and stacked into a cel sandwich for filming. 
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Introduction 


Previous work has demonstrated that key frame 
animation techniques constitute a successful approach 
to animation of free-form images [1-3]. Using this 
technique, the artist draws key images at selected in- 
tervals in an animation sequence and the playback 
program computes the in-between images by interpola- 
tion. Interpolation between related key images allows 
the animation of change of shape or distortion. It 
permits a direct and intuitive method for specifying the 
action, whereas mathematically defined distortion re- 
quires trial and error experimentation. One strength 
of key frame animation techniques ts the analogy to 
conventional hand animation techniques, simplifying 
the transition when a classically trained animator 
adapts to using computers. 

Figure | illustrates a typical image sequence gen- 
erated using key frame animation. The first and last 
images were drawn by the artist, while the six inter- 
mediate images were selected from the 240 frames in 
the actual film sequence. 

The animation package is implemented on a mini- 
computer based interactive graphics system. This pack- 
age supports the four major phases of a production: 
(1) the drawing phase, (2) the assembly of drawings 
into key frame sequences, (3) the preview /modification 
phase, and (4) the final processing and recording of 
the sequences on film. , 

The drawing phase is carried out in two stages. The 
first stage is off-line at the drawing board. Analysis of 
the action depicted by the story board establishes key 
positions from which drawings are prepared. The 
second stage involves tracing these drawings on a 
graphic tablet at the display console. During this 
stage, the order in which strokes are traced to describe 
an image 1s important. Since the interpolation process 
is based on stroke to stroke mapping, this ordering of | 
strokes between related images controls the form of 
the intermediate image. 

The second phase consists of the interactive assem- 
bly of individual drawings or cels' into key frames, 
including a specification of the interpolation law for 
each cel and a key to key time interval. Concatenated 
key frames form a sequence. This process is repeated 
for all concurrent sequences that make up a composite 
sequence. 

During the preview phase, playback of any indi- 
vidual sequence or concurrent sequences on. the inter- 
active display permits an assessment of the resulting 
animation. Modification involves returning to the in- 
teractive assembly phase to edit the sequences. In. 
practice, direct playback assures only that the form 
of the interpolated images can be assessed, since it 1s 
difficult to achieve playback at the cine rate with’ 
complex images. Proper assessment of motion and 
timing requires further conversion to a raster format 
which maintains display at the cine rate independent 


Communications October !976 
of . Volume 19 
th: ACM Number 10 





of the image content. 

This technique of animating free-form images has 
been directed mainly towards drawn images and 
hence two-dimensional. The image material, of course, 


attempts to represent a 3-D space much as in conven- 


tional cel animation. The basic capability includes a 
simplified solution to the problem of hidden surfaces 
by treating the image as a hierarchy of parallel planes. 
The simplification lies in the fact that the animator- 
specified order of planes establishes the order of visi- 
bility computation, thus eliminating any programmed 
sorting of data by depth. The composite playback facil- 


ity produces separately a composite line image with | 


hidden lines removed and a composite surface sequence. 


Consideration of Motion Dynamics 


The greatest shortcoming in key frame animation 
results from incomplete control of motion dynamics, 
both in complexity and in smoothness or continuity. 
It is relatively simple to have good control over the 
dynamics in time. The amount of change from one 
frame to the next is determined by a weighting factor 
which is a single-valued function of time. Thus one can 
easily compute, or store precomputed, various func- 
tions representing different “tapers.” However, the 
same value of weighting function is applied to an entire 
picture component. There is no ‘spatial weighting.” 

The shortcomings manifest themselves in the fol- 
lowing ways: (a) the motion of each point in the image 
is along a straight line and the relative change from 
one frame to the next is the same for all points belong- 
ing to one picture element, and (b) there is a discon- 
tinuity at key frames in both the amount of frame to 
frame change and in the direction of apparent motion. 
Therefore it is difficult to synthesize smooth continuous 
motion spanning several key positions. There is a 
dilemma in that smoothness is achieved by having as 
few key images as possible (and therefore widely spaced 
in time), while close control requires many closely 
spaced keys. In addition, a large number of closely 
spaced drawings negates much of the economic ad- 
vantage of using computers. 

Various techniques have been examined for over- 
coming these problems. One technique provides an 
ability to include a rotational component as part of 
the image change, which in effect superimposes rotation 
on the interpolation process {2}. This permits some 
variation in the spatial dynamics but its application is 
limited and discontinuities at key frames remain. 

Synthesis of complex motion could be achieved by 
using additional intermediate keys, but preparation of 
additional drawings by the animator is uneconomical. 
Skeleton techniques were developed to derive variations 
of existing key drawings to be used as intermediate 
keys [2|. This involves representation of a drawing by 
a simple skeleton and then extracting a distorted form 
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of the image by modifying only the skeleton. Even 
when such additional keys are used, discontinuity in 
motion is difficult to avoid. 

Another technique involves the use of smooth 
drawn paths to control the interpolation process. 
Motion along a path, as a method distinct from key 
frame animation, has been used extensively in com- 
puter animation [4-7]. A single path to control inter- 
polation between key images offers a limited solution 
somewhat equivalent to the use of rotation—it tends 
to be satisfactory only when the distortion of the 
image is minimal. For a distorting image, different 
portions must follow entirely different paths. This 
immediately leads to a problem if several paths are to 
be drawn for different portions of the image. It is 
difficult to establish points of simultaneity on several 
paths such that one could easily perceive the shape of 
the image at any instant. | 

An examination of the methods used in conven- 
tional animation has led to a solution to this problem. 


Fig. 1. Selected frames from a key frame animation sequence. 
The first and last images are drawn, the intermediate images are 
interpolated. Multilayer visibility is included in computing the 
composite image. From ‘‘Visage,” a film by Peter Foldes 
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To visualize a complex movement, the animator often 
sketches stick figure representations at equal-time 
intervals between key positions. He may use smooth 
curves through related skeletal points as a further 
guide. This set of stick figures achieves both objectives: 
the frame to frame spacing conveys the rate of move- 
ment and the shape of each skeleton represents the 
shape of the object at that instant. Thus the problem 
reduces to animating a stick figure representation of the 
image which will in turn impart the movement to the 
actual image sequence. 

The system described in this paper incorporates the 


use of skeletons into the key frame technique to pro- 


vide overall control in the playback process. As in the 
basic key frame animation system, the process of 
producing a sequence involves two steps. The first is 
the interactive stage at which the animator prepares 
the key images and establishes the stick figure repre- 
sentations at as many intermediate positions as desired. 
The intermediate skeletons define intermediate control 
keys. During playback the program selects those 
image. components that are skeleton driven and applies 
the necessary deformation. 

There are two significant aspects of the skeleton 
driven technique. First, the skeletons are simple images 
composed of only a few points, so that it is possible to 
provide a high level of interaction. The second aspect 
of this technique is its compatibility with basic key 
frame animation. Skeleton sequences are prepared 
only where necessary and the playback system identi- 
fies those image components that are skeleton driven 
and those that are not. 

It should be noted that the concept of skeletons 
used in the context of this paper differs from that used 
by Blum [8]. Blum’s skeletons are used for image 
replesentation in a compressed form and are derived 
automatically from the coordinate data. Our skeleton 
representation of an image provides a definition of 
some coordinate space within which the image, de- 
scribed in relative coordinates, is distributed. 


Skeleton Coordinate System 
The nature of the coordinate space that is used to 


define relative skeleton coordinates may be thought of 
as a network of polygons that form a mesh (Figure 2). 
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Each polygon has a relative coordinate range of 0 to 
1.0 along each axis. Now the nodes in this mesh may 
be displaced relative to one another to change its 
geometry. However, because the relative coordinate 
system within each polygon is based on its geometry, 
coordinate values remain continuous across common 
edges between adjacent polygons. Thus any image 
whose coordinates are defined within this system will 


‘take on the overall distortion exhibited by the coordi- 


nate space (Figure 3). 


EOE 
| 


a 


wa 
ais 








Geometric transformation of contours from one coordi- 
nate space to another ts, of course, well known in con- 
formal mapping. 

The notion of skeleton contro! implies a central 
core of connected ‘‘bones” with a surrounding image 
distribution. In order to restrict the transverse dis- 
tance away from the core over which skeleton control 
will be active, delimiting boundaries must be specified. 
Consequently, the practical form of skeleton coordinate 
Space spans two units in width, but may extend in 
length as desired (Figure 4). 





For convenience, the central core always represents the 
L axis, which is also the W = 0 coordinate reference; 
the delimiting boundary which is specified first is the 
positive or W = 1.0 boundary, the other is the W = 
~— 1.0 boundary. The ZL coordinate range starts at L = 0 
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and is incremented by one for each node on the central 
core. If desired, the L coordinate space may be sepa- 
rated at any coordinate boundary by providing a 
redefinition of that coordinate boundary before con- 
tinuing the coordinate space. Of course, the related 
image will not normally continue through such a 
separation. In addition, ambiguities can occur if sepa- 
rated coordinate spaces overlap. In general, it is prefer- 
able to treat these as separate skeletons so that no 
restrictions are imposed. On the other hand, any given 
image need not fall entirely within the coordinate 
space of a skeleton. Those points which lie outside the 
skeleton space will remain unaffected by the distortion 
of the skeleton coordinate space. 

Relative coordinates, denoted by (I, w), may be 
defined as the fractional distance along each axis which 
is occupied by a line passing through the point while 
intersecting the two opposing edges of the polygon at 
this fractional distance. In Figure 5(a), the coordinates 
of point P are (0.75, 0.5) by this definition. In order to 
minimize the computation involved in coordinate con- 
version, the simpler definition of Figure 5(b) is used. 





(a) (b) 


Given the absolute vertex coordinates of the polygon and 
an image point P, the fractional distance of points P; 
and P, along lines P;P, and P3P; is expressed by J, giving 


. Xs = U(x, = MAP As. Vere Ky. ~ ya) + yi, 
Ux, — x3) + x3, ye = Kyi — ys) + ya, 


These expressions are substituted into (x — x6)/(y — Ye) 
= (x, — x)/(); — y), the equation of the line P;P. 


passing through point P, giving 
[x — xs — Ux. — xs)/ly — ys — Kya — vad] 
= [x — x — Ux, — m)\/[y — ni — Le — ys))- 


l 


X6 


This reduces to 
(BH — DF)P + (CF —~ DE — AH — BG)I 
+ (AG — CE) = 0 


where 

A=x-—x,, E=x—-— xs, 
B=x2.—-— x, Fo= x3 — X3, 
C=y-y, G=y- ys, 


D=y,—-y, H= yy — ys, 


The desired root for J has a value between O and I, 
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‘the other root will be negative or greater than 1. The 


w-coordinate, expressing the fractional distance of 
point P along line P;P., is given by 


(x — X5)/(x6 — xs) 
(A — BI/[A ~ E — (B — FYI). 


w 


I 


To convert relative coordinates back to display coordi- 
nates, the L-coordinate is applied to the vertex coordi: 
nates to determine the coordinates of P; and Ps from 
which P is found. 

The effect of skeleton control is to take any specified 
area of the display plane and distort it into another area 
of the display plane as if it were made up of rubber 
Sheet patches. In that sense, it is similar to the distorted 
raster scan technique used in Caesar [9], and equivalent 
to the mapping of images into curved surfaces de- — 
scribed by Catmull [10]. While the skeleton coordinate 
Space 1S distorting, however, the relative coordinates of 
the image itself may be undergoing a change. Relative 
coordinates may be treated in the same way as absolute 
coordinates, as if the reference coordinate space was 
always uniform and orthogonal, its particular shape 
being important only for display purposes (Figure 6). 
Therefore the key frame interpolation process may 
still be carried out even if key images are represented in 
relative coordinates. It is this compatibility with key 
frame animation that makes the skeleton control tech- 
nique so powerful and attractive. No other practical 
techniques have been developed that offer a compar- 
able degree of image control in computer generated 
animation. 


Implementation within Key Frame Animation 


The benefits that may be derived in practice from 
Skeleton control are closely related to the method of 
implementation. While there is little doubt that any 
capability for enriching motion is useful, it is equally 
clear that the compulsory use of skeleton control for 
all parts of an animation sequence would be a great 
hindrance. The full advantage in the use of this tech- 
nique is realized only if the animator can apply it 
selectively. In fact, it is most attractive if it can be 
used to improve motion dynamics of sequences which 
were previously created. This is the form in which it 
has been implemented in our system. 

Component sequences are first assembled in the 
usual form and displayed as a composite image se- 
quence for previewing. If, after assessment, the ani- 
mator wants to modify or improve parts of it, he does 
so by adding skeleton control to those components 
only. This is accomplished by attaching a reference 
skeleton, which he has drawn, to each image which 
will be controlled. These image components, which are 
referenced to skeletons, are converted to relative co- 
ordinates and tagged during assembly of the sequence. 
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Now the composite sequence is regenerated in this 
modified form. During playback, all coordinate data 
pass through the interpolation process, but those 
identified as being relative must be mapped to a par- 
ticular skeleton reference for display. A skeleton ref- 
erence defines a display space in absolute coordinates. 
These skeleton coordinate references for each frame 
are provided by assembling stick figure control se- 
quences which are also played back as part of the 
composite sequence. 

The design of the skeleton control sequence itself 
is developed interactively in a separate package. Stick 
figure representations of key images provide the starting 
point (Figure 7(a)). Two such skeletons are used to 
define a start and end frame. When the INBETWEEN 
display mode is active, intermediate frames are in- 
terpolated and presented on the screen as a superposi- 
tion of many frames (Figure 7(b)). For convenience, 
the delimiting boundaries about the skeleton core are 
eliminated from display to prevent excessive clutter. 
Any frame may now be selected and modified using 
tablet interaction. In this mode, the modified coordi- 
nates of the selected frame are stored as a control 
frame within the sequence. The interpolated inter- 
mediate frames adjust accordingly in response to tablet 
interaction (Figure 7(c)). Additional frames may be 
modified in a similar manner (Figure 7(d)). 

Frame to frame change is easily related to the 
spacing between stick figures. This interaction con- 
tinues, giving the animator control over motion dy- 
namics down to the. frame level as in conventional 
animation, if desired. The user-modified control frames 
are preserved, whereas all other intermediate stick 
figures are recomputed when needed. Display of the 
final sequence of control frames is shown in Figure 
T(e). The skeleton boundaries have been adjusted 
where required through similar tablet interaction to 
maintain their desired form. Alternatively skeleton 
keys which have been drawn in these desired positions 
may be brought in from the picture library and as- 
sembled as control frames in the same way. 

In practice, the number of control frames that are 
used to generate a motion sequence will be kept to a 
minimum. Because of this, simple linear interpolation 
between specified control frames may not adequately 
reproduce a smooth continuous movement. This result 
is illustrated in Figure 7(f), where the dynamics of the 
movement suffer from excessive discontinuities in 
rate at two of the control frames. Additional inter- 
mediate frames have been interpolated and plotted for 
clarity. | 

This deficiency is removed if a smoothing function 
is applied during computation of intermediate frames 
(Figure 7(g)). This process maintains continuity of 
movement of corresponding points through successive 
contro! frames. The parametric method of curve fitting 
is adapted from the work of Akima {I1]. Not only does 
it produce a smooth path for each point, but progression 
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Fig. 6. (a), (c) two drawn images in absolute coordinates; (b), 
(d) same images with reference skeletons; (e), (f) the relative co- 
ordinates presented on an orthogonal coordinate system. 





Fig. 7. Development of a skeleton control sequence: (a) start and 
end frames; (b) interpolated inbetweens presented for interaction, 
(c), (d) frames 5S and 9 modified in turn by the animator (control 
frames); (e) final sequence of control frames, shown with bound- 
aries; (f) linearly interpolated inbetweens; (g), (h) with curve 
smoothing; (i), (j) images driven by the motion sequence. 
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along each path is tapered to accommodate changes 
in rate in a complex movement. Successive positions of 
several points have been joined in Figure 7(h) to em- 
phasize the effect. This process has all the character- 
istics of drawing smooth paths to control interpolation 
between images without any of its limitations. 

Now the skeleton control information is complete 
for driving the original image sequence through the 
desired movement (Figure 7(i)). It may equally well 
be applied to drive any other compatible image se- 
quence through the same motion. In Figure 7(j), al- 
though the image sequence itself specifies a HOLD 
during that interval, the skeleton sequence drives it 
through the same motion cycle. The relationship 
between a start/end frame skeleton pair and the num- 
ber of key images in that interval is arbitrary. It should 
also be clear that the interpolation rate between key 
Images is independent of the progression of its skeleton 
through a movement. Because of this, the smoothing 
process may span as many key images as required to 
complete a continuous movement. 

Various program features assist the animator in 
developing a skeleton control sequence. Direct viewing 
of skeleton movement is obtained by requesting the 
ANIMATE display mode at any time. This causes the 
sequence of intermediate frames from the start to the 
end frame to be continuously cycled at the cine rate 
instead of being presented as a static ensemble. Com- 
panion skeletons that have been developed for several 
cels may be previewed together in this mode as well as 
during interaction. If the INBETWEEN display presen- 
tation contains confusing frame to frame overlap 
(e.g. walking on the spot), a positional offset may be 
introduced into successive control frames to remove 
the ambiguity. Similarly any number of intermediate 
frames may be skipped to simplify the overall display 
during interactive modification. 


Proposed Extensions to the System 


Because final processing of skeleton control se- 
quences is performed in the composite playback 
program, the same camera control commands that 
apply pan, ult, and zoom to an image sequence can be 
applied to the control sequence. With an extension of 
the system, more complex processing functions could 
be applied to the contro! skeleton. This approach may 
be useful for superimposing complex forms of move- 
ment control on the skeleton. It may also significantly 
reduce the processing time for rotation since only the 
Skeleton coordinate references need to be rotated. 

Another useful extension deals with the capability 
for creating a library of common movements. If se- 
quences of control frames are saved in a normalized 
form, they can be retrieved and superimposed on any 
particular skeleton as a starting point for developing 
variations of that movement. Since the details of the 
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image itself are not contained in the skeleton, this 


necessitates only that the form of this Skeleton match 


the standard normalized form (i.e. it consists of the 
same connection of bones). All the physical character- 
istics of the particular skeleton being used (such as 
relative length and width of each section) will be re- 
tained while only the stored motion characteristics 
will be transferred. Although this capability has not 
been implemented in the present system, it does indeed 
offer an important potential reduction in animation 
production costs. 
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1 ABSTRACT 


This paper describes how evolutionary techniques of variation and 
selection can be used to create complex simulated structures, tex- 
tures, and motions for use in computer graphics and animation. 
Interactive selection, based on visual perception of procedurally 
generated results, allows the user to direct simulated evolutions in 
preferred directions. Several examples using these methods have 
been implemented and are described. 3D plant structures are grown 
using fixed sets of genetic parameters. Images, solid textures, and 
animations are created using mutating symbolic lisp expressions. 
Genotypes consisting of symbolic expressions are presented as an 
attempt to surpass the limitations of fixed-length genotypes with 
predefined expression rules. It is proposed that artificial evolution 
has potential as a powerful tool for achieving flexible complexity 
with a minimum of user input and knowledge of details. 


2 INTRODUCTION 


Procedural models are increasingly employed in computer graphics 
to create scenes and animations having high degrees of complexity. 
A price paid for this complexity is that the user often loses the 
ability to maintain sufficient control over the results. Procedural 
models can also have limitations because the details of the pro- 
cedure must be conceived, understood, and designed by a human. 
The techniques presented here contribute towards solutions to these 
problems by enabling “evolution” of procedural models using inter- 
active “perceptual selection.” Although they do not give complete 
control over every detail! of the results, they do permit the creation 
of a large variety of complex entities which are still user directed, 
and the user is not required to understand the underlying creation 
process involved. 

Many years ago Charles Darwin proposed the theory that all 
species came about via the process of evolution [2]. Evolution is 
now considered not only powerful enough to bring about biological 
entities as complex as humans and consciousness, but also useful 
in simulation to create algorithms and structures of higher levels 
of complexity than could easily be built by design. Genetic algo- 
rithms have shown to be a useful method of searching large spaces 
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using simulated systems of variation and selection [5, 6, 7, 23]. 
In The Blind Watchmaker, Dawkins has demonstrated the power 
of Darwinism with a simulated evolution of 2D branching struc- 
tures made from a set of genetic parameters. The user selects the 
“biomorphs” that survive and reproduce to create each new gen- 
eration [3, 4}. Latham and Todd have applied these concepts to 
help generate computer sculptures made with constructive solid 
geometry techniques [9, 28]. 

Variations on these techniques are used here with the emphasis 
on the potential of creating forms, textures, and motions that are 
useful in the production of computer graphics and animation, and 
also on the potential of using representations that are not bounded 
by a fixed space of possible results. 


2.1 Evolution 


Both biological and simulated evolutions involve the basic con- 
cepts of genotype and phenotype, and the processes of expression, 
selection, and reproduction with variation. 

The genotype is the genetic information that codes for the cre- 
ation of an individual. In biological systems, genotypes are nor- 
mally composed of DNA. In simulated evolutions there are many 
possible representations of genotypes, such as strings of binary dig- 
its, sets of procedural parameters, or symbolic expressions. The 
phenotype is the individual itself, or the form that results from the 
developmental mules and the genotype. Expression is the process 
by which the phenotype is generated from the genotype. For ex- 
ample, expression can be a biological developmental process that 
reads and executes the information from DNA strands, or a set of 
procedural rules that utilize a set of genetic parameters to create a 
simulated structure. Usually, there is a significant amplification of 
information between the genotype and phenotype. 

Selection is the process by which the fitness of phenotypes is 
determined. The likelihood of survival and the number of new 
offspring an individual generates is proportional to its fitness mea- 
sure. Fitness is simply the ability of an organism to survive and 
reproduce. In simulation, it can be calculated by an explicitly de- 
fined fitness evaluation function, or it can be provided by a human 
observer as it is in this work. 

Reproduction is the process by which new genotypes are gen- 
erated from an existing genotype or genotypes. For evolution to 
progress there must be variation or mutations in new genotypes with 
some frequency. Mutations are usually probabilistic as opposed to 
deterministic. Note that selection is, in general, non-random and 
is performed on phenotypes; variation is usually random and is 
performed on the corresponding genotypes [See figure 1]. 
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phenotype can be saved for further manipulation. Solid polygo- 
nal branches can be generated with connected cylinders and cone 
shapes, and leaves can be generated by connecting sets of periph- 
eral nodes with polygonal surfaces. Shading parameters, color, 
and bump textures can be assigned to make bark and leaf surfaces. 
These additional properties could also be selected and adjusted us- 
ing artificial evolution, but due to the longer computation times 
involved to test samples, these parameters were adjusted by hand. 
In some cases, leaf shapes were evolved independently and then ex- 
plicitly added to the tip segments of other evolved plant structures. 
A forest of plant structures created using these methods ts shown in 
figure 3. 


3.2 Mutating Parameter Sets 


For artificial evolution of parameter sets to occur, they must be 
reproduced with some probability of mutation. There are many 
possible methods for mutating parameter sets. The technique used 
here involves normalizing each parameter for a genetic value be- 
tween .0 and 1.0, and then copying each genetic value or gene, g;, 
from the parent to the child with a certain probability of mutation, 
m. A mutation is achieved by adding a random amount, +d, to the 
gene. So, anew genotype, G’, is created using each gene, gi, of a 
parent genotype, G, as follows: 


For each g; 
If rand(.0,1.0) < m 
then g) = gq, + rand(—d.d) 
clamp or wrap g; to legal bounds. 
else g, = 9 


The normalized values are scaled, offset, and optionally squared to 
give the parameter values actually used. This allows the mutation 
distances, +d, to be proportional to the scale of the range of valid 
parameter values. Squaring or raising some values to even higher 
powers can be useful because it causes more sensitivity in the lower 
region of the range of parameter values. The mutation rate and 
amount are easily adjusted, but are commonly useful at much higher 
values than in natural systems (m = 0.2,d = 0.4). The random 
value between —d and d might preferably be found using a Gaussian 
distribution instead of this simple linear distribution, giving smaller 
mutations more likelihood than larger ones. 


3.3 Mating Parameter Sets 


When two parameter sets are found that both create structures with 
different successful features, it is sometimes desirable to combine 
these features into a single structure. This can be accomplished by 
mating them. Reproducing two parameter sets with sexual combi- 
nation can be performed in many ways. Four possible methods are 
listed below with some of their resulting effects: 

1. Crossovers can be performed by sequentially copying genes 
from one parent, but with some frequency the source genotype is 
switched to the other parent. This causes adjacent genes to be more 
likely to stick together than genes at opposite ends of the sequence. 
Each pair of genes has a linkage probability depending on their 
distance from each other. 

2. Each gene can be independently copied from one parent or 
the other with equal probability. Ifthe parent genes each correspond 
to a point in N-dimensional genetic space, then the genes of the 
possible children using this method correspond to the 2% corners 
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Figure 3: Forest of “evolved” plants. 


of the N-dimensional rectangular solid connecting the two parent 
points. This method is the most commonly used in this work and ts 
demonstrated in figure 2. Two parent plant structures are shown in 
the upper left boxes, and the remaining forms are their children. 

3. Each gene can receive a random percentage, p, of one par- 
ent’s genes, and a 1 — p percentage of the other parent’s genes. 
If the percentage is the same for each gene, linear interpolation 
between the parent genotypes results, and the children will fall 
randomly on the line between the V-dimensional points of the par- 
ents. If evenly spaced samples along this line were generated, a 
genetic dissolve could be made that would cause a smooth transition 
between the parent phenotypes if the changing parameters had con- 
tinuous effects on the phenotypes. This is an example of utilizing 
the underlying genetic representation for specific manipulation of 
the results. Interpolation could also be performed with three parents 
to create children that fall on a triangular region of a plane in the 
N-dimensional genetic space. 

4. Finally, each new gene can receive a random value between 
the two parent values of that gene. This is like the interpolation 
scheme above, except each gene is independently interpolated be- 
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tween the parent genes. This method results in possible children 
anywhere within the N-dimensional rectangular solid connecting 
the parent points. 


Mutating and mating parameter sets allow a user to explore and 
combine samples in a given parameter space. In the next section, 
methods are presented that allow mutations to add new parameters 
and extend the space, instead of simply adjusting existing parameter 
values. 


4 SYMBOLIC EXPRESSIONS AS GENOTYPES 


A limitation of genotypes consisting of a fixed number of parameters 
and fixed expression rules as described above ts that there are solid 
boundaries on the set of possible phenotypes. There is no possibility 
for the evolution of a new developmental rule or a new parameter. 
There is no way for the genetic space to be extended beyond its 
original definition — the N-dimensional genetic space will remain 
only N-dimensional. 

To surpass this limitation, it is desirable to include procedural 
information in the genotype instead of just parameter data, and 
the procedural and data elements of the genotype should not be 
restricted to a specific structure or size. 

Symbolic lisp expressions are used as genotypes in an attempt 
to meet these needs. A set of lisp functions and a set of argument 
generators are used to create arbitrary expressions which can be 
mutated, evolved, and evaluated to generate phenotypes. Some 
mutations can create larger expressions with new parameters and 
extend the space of possible phenotypes, while others just adjust 
existing parts of the expression. Details of this process are best 
described by the examples below. 


4.1 Evolving Images 


The second example of artificial evolution involves the generation 
of textures by mutating symbolic expressions. Equations that cal- 
culate a color for each pixel coordinate (x, y) are evolved using a 
function set containing some standard common lisp functions [26], 
vector transformations, procedural noise generators, and image pro- 
cessing operations: 


+, —, *, /, mod, round, min, max, abs, expt, log, and, 
or, xor, sin, cos, atan, if, dissolve, hsv-to-rgb, vector, 
transform-vector, bw-noise, color-noise, warped-bw- 
noise, warped-color-noise, blur, band-pass, grad-mag, 
grad-dir, bump, ifs, warped-ifs, warp-abs, warp-rel, 
warp-by-grad. 


Each function takes a specified number of arguments and calculates 
and returs an image of scalar (b/w) or vector (color) values. 
Noise generators can create solid 2D scalar and vector noise 
at various frequencies with random seeds passed as arguments so 
specific patterns can be preserved between generations [figure 4f, 
and 4i}. The warped versions of functions take (U, V) coordinates 
as arguments instead of using global (X,Y) pixel coordinates, 
allowing the result to be distorted by an arbitrary inverse mapping 
function [figure 4i]. Boolean operations (and, or, and xor) operate 
on each bit of floating-point numbers and can cause fractal-like 
grid patterns [figure 4e]. Versions of sin and cos which normalize 
their results between .0 and 1.0 instead of .-1.0 and 1.0 can be 
useful. Some functions such as blurs, convolutions, and those that 
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Figure 4: Simple expression examples. 


(reading left to right, top to bottom) 


c. (abs X) 

d. (mod X (abs Y)) 

e. (and X Y) 

f. (bw-noise .2 2) 

g. (color-noise .1 2) 

h. (grad-direction (bw-noise .15 2) .0 0) 
i. (warped-color-noise (* X .2) Y 1 2) 


use gradients also use neighboring pixel values to calculate their 
result [figure 4h]. Band-pass convolutions can be performed using 
a difference of Gaussians filter which can enhance edges. Iterative 
function systems (ifs) can generate fractal patterns and shapes. 

Details of the specitic implementations of these functions are 
not given here because they are not as important as the methods used 
for combining them into longer expressions. Many other functions 
would be interesting to include in this function set, but these have 
provided for a fairly wide variety of resulting images. 

Simple random expressions are generated by choosing a func- 
tion at random from the function set above, and then generating as 
many random arguments as that function requires. Arguments to 
these functions can be either scalars or vectors, and either constant 
values or images of values. Random arguments can be generated 
from the following forms: 


— Arandom scalar value such as .4 

— A random 3-element vector such as #(.42 .23 .69) 

— A variable such as the X or Y pixel coordinates. 

— Another lisp expression which returns a b/w or 
color image. 


Most of the functions have been adapted to either coerce the 
arguments into the required types, or perform differently according 
to the argument types given to them. Arguments to certain functions 
can optionally be restricted to some subset of the available types. 
For the most part these functions receive and return images, and 
can be considered as image processing operations. Expressions are 
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simply evaluated to produce images. Figure 4 shows examples of 
some simple expressions and their resulting images. 

Artificial evolution of these expressions is performed by first 
generating and displaying a population of simple random expres- 
sions in a grid for interactive selection. The expressions of images 
selected by the user are reproduced with mutations for each new 
generation such that more and more complex expressions and more 
perceptually successful images can evolve. Some images evolved 
with this process are shown in figures 9 to 13. 


4.2 Mutating Symbolic Expressions 


Symbolic expressions must be reproduced with mutations for evo- 
lution of them to occur. There are several properties of symbolic 
expression mutation that are desirable. Expressions should often 
be only slightly modified, but sometimes significantly adjusted in 
structure and size. Large random changes in genotype usually 
result in large jumps in phenotype which are less likely to be im- 
provements, but are necessary for extending the expression to more 
complex forms. 

A recursive mutation scheme is used to mutate expressions. 
Lisp expressions are traversed as tree structures and each node is in 
turn subject to possible mutations. Each type of mutation occurs at 
different frequencies depending on the type of node: 


1. Any node can mutate into a new random expression. This 
allows for large changes, and usually resuits in a fairly significant 
alteration of the phenotype. 

2. If the node is a scalar value, it can be adjusted by the addition 
of some random amount. 

3. If the node is a vector, it can be adjusted by adding random 
amounts to each element. 

4. If the node is a function, it can mutate into a different function. 
For example (abs X) might become (cos X). If this mutation occurs, 
the arguments of the function are also adjusted if necessary to the 
correct number and types. 

5. An expression can become the argument to a new random 
function. Other arguments are generated at random if necessary. 
For example X might become (* X .3). 

6. An argument to a function can jump out and become the new 
value for that node. For example (* X .3) might become X. This is 
the inverse of the previous type of mutation. 

7. Finally, a node can become a copy of another node from the 
parent expression. For example (+ (abs X) (* Y .6)) might become 
(+ (abs (* Y .6))(* Y .6)). This causes effects similar to those caused 
by mating an expression with itself. It allows for sub-expressions 
to duplicate themselves within the overall expression. 


Other types of mutations could certainly be implemented, but 
these are sufficient for a reasonable balance of slight modifications 
and potential for changes in complexity. 

It is preferable to adjust the mutation frequencies such that a 
decrease in complexity is slightly more probable than an increase. 
This prevents the expressions from drifting towards large and slow 
forms without necessarily improving the results. They should still 
easily evolve towards larger sizes, but a larger size should be due 
to selection of improvements instead of random mutations with no 
effect. 

The relative frequencies for each type of mutation above can be 
adjusted and experimented with. The overall mutation frequency is 
scaled inversely in proportion to the length of the parent expression. 
This decreases the probability of mutation at each node when the 


parent expression is large so that some stability of the phenotypes 
is Maintained. 

The evaluation of expressions and display of the resulting im- 
ages can require significant calculation times as expressions in- 
crease in complexity. To keep image evolution at interactive speeds, 
estimates of compute speeds are calculated for each expression by 
summing pre-computed runtime averages for each function. Slow 
expressions are eliminated before ever being displayed to the user. 
New offspring with random mutations are generated and tested until 
fast enough expressions result. In this way automatic selection is 
combined with interactive selection. If necessary, this technique 
could also be performed to keep memory usage to a minimum. 


4.3. Mating Symbolic Expressions 


Symbolic expressions can be reproduced with sexual combinations 
to allow sub-expressions from separately evolved individuals to be 
mixed into a single individual. Two methods for mating symbolic 
expressions are described. 

The first method requires the two parents to be somewhat sim- 
ilar in structure. The nodes in the expression trees of both parents 
are simultaneously traversed and copied to make the new expres- 
sion. When a difference is encountered between the parents, one 
of the two versions is copied with equal probability. For example, 
the following two parents can be mated to generate four different 
expressions, two of which are equal to the parents, and two of which 
have some portions from each parent: 


parentl: (* (abs X) (mod X Y)) 
parent2: (* (/ Y X) (* X -.7)) 


child1: (* (abs X) (mod X Y)) 
child2: (* (abs X) (* X -.7)) 
child3: (* (/ Y X) (mod X Y)) 
child4: (* (/ Y X) (* X -.7)) 


This method is often useful for combining similar expressions that 
each have some desired property. It usually generates offspring 
without very large variations from the parents. Two expressions 
with different root nodes will not form any new combinations. This 
might be compared to the inability of two different species to mate 
and create viable offspring. 

The second method for mating expressions combines the parents 
in a less constrained way. A node in the expression tree of one parent 
is chosen at random and replaced by a node chosen at random from 
the other parent. This crossing over technique allows any part 
of the structure of one parent to be inserted into any part of the 
other parent and permits parts of even dissimilar expressions to be 
combined. With this method, the parent expressions above can 
generate 61 different child expressions — many more than the 4 of 
the first method. 


4.4 Evolving Volume Textures 


A third variable, Z, is added to the list of available arguments to 
enable functions to be evolved that calculate colors for each point in 
(X,Y, Z) space. The function set shown in section 4.1 is adjusted 
for better results: 2D functions that require neighboring pixel values 
such as convolutions and warps are removed, and 3D solid noise 
generating functions are added. 

These expressions are more difficult to visualize because they 
encompass all of 3D space. They are evaluated on the surfaces 
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Figure 6: Marble and wooden tori. 


of spheres and planes for fast previewing and selection as shown 
in figure 5. Evolved volume expressions can then be incorporated 
into procedural shading functions to texture arbitrary objects. This 
process allows complex volume textures such as those described 
in [18] and [19] to be evolved without requiring specific equations 
to be understood and carefully adjusted by hand. Figure 6 was 
generated by evolving three volume texture expressions and then 
evaluating them at the surfaces positions of three objects during the 
rendering process. 


4.5 Evolving Animations 


Several extensions to the image evolution system described above 
can be used to evolve moving images. Five methods for incorpo- 
rating a temporal dimension in symbolic expressions are proposed: 

1. Another input variable, Tv7ne, can be added to the list of 
available arguments. Expressions can be evolved that are functions 
of X,Y, and Tze such that different images are produced as the 
value of Time is smoothly animated. More computation is required 
to generate, display and select samples because a sequence of im- 
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ages must be calculated. An alternate method of display involves 
displaying various slices of the (X, Y, Time) space (although op- 
erations requiring neighboring pixel values might not receive the 
correct information if the values of Time vary between them). 

2. Genetic cross dissolves can be performed between two ex- 
pressions of similar structure. Interpolation between two expres- 
sions is performed by matching the expressions where they are iden- 
tical and interpolating between the results where they are different. 
Results of differing expression branches are first calculated and 
dissolved, and then used by the remaining parts of the expression. 
If the two expressions have different root nodes, a conventional 
image dissolve will result. If only parts within their structures are 
different, interesting motions can occur. This technique utilizes the 
existing genetic representation of evolved still images to generate 
in-betweens for a smooth transition from one to another. It is an 
example of the usefulness of the alternate level of control given 
by the underlying genetic information. A series of frames from a 
genetic cross dissolve are shown in figure 7. 

3. An input image can be added to the list of available arguments 
to make functions of X,Y, and /mage. The input image can then 
be animated and processed by evaluating the expression multiple 
times for values of /mage corresponding to frames of another 
source of animation such as hand drawn or traditional 3D computer 
graphics. This is effectively a technique for evolving complex 
image processing and warping functions that compute new images 
from given input images. Figure 8 was created in this way with an 
input image of a human face. 

4, The images that use the pixel coordinates (X,Y) to de- 
termine the colors at each pixel can be animated by altering the 
mappings of X and Y before the expression is evaluated. Simple 
zooming and panning can be performed as well as 3D perspective 
transformations and arbitrary patterns of distortion. 

5. Evolved expressions can be adjusted and experimented with 
by hand. If parameters in expressions are smoothly interpolated 
to new values, the corresponding image will change in potentially 
interesting ways. For example, solid noise can be made to change 
frequency, colors can be dissolved into new shades, and angles 
can be rotated. This is another example of utilizing the underlying 
genetic information to manipulate images. A small change in the 
expression can result in a powerful alteration of the resulting image. 

Finally, the techniques above can be used together in various 
combinations to make an even wider range of possibilities for evolv- 
ing animations. 
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Figure 7: Frames from a “genetic cross dissolve.” 
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Figure 8: Fire of Faces. Figure 9. 


5 RESULTS 


Evolution of 3D plant structures, images, solid textures, and an- 
imations have been implemented on the Connection Machine |/*! 
system CM-2, a data parallel supercomputer [10,27]. The parallel 
implementation details will not be discussed in this paper, but each 
application is reasonably suited for highly parallel representation 
and computation. Lisp expression mutations and combinations are 
performed on a front-end computer and the Connection Machine 
system is used to evaluate the expression for all pixels in parallel 
using Starlisp and display the resulting image. 

3D Plant structures have been evolved and used in the animated 
short Punspermia {24}. A frame from this sequence is shown in 
figure 3 which contains a variety of species created using these 
techniques. An interactive system for quickly growing, displaying, 
and selecting sample structures allows a wide range of plant shapes 
to be efficiently created by artificial evoluuon. Populations of 
samples can be displayed for selection in wire frame in a grid format 
as shown in figure 2, or displayed as separate higher-resolution Figure 10. 
images which can be interactively flipped through by scrolling with 
a mouse. Typically between 5 and 20 generations are necessary for 
acceptable structures to emerge. 





Images, volume textures, and various animations have been 
created using mutating symbolic expressions. These sometimes 
require more generations to evolve complex expressions that give 
interesting images - often at least 10 to 40 generations. Again, an 
interactive tool for quickly displaying grids of sample images to be 
selected amongst makes the evolution process reasonably efficient. 
[See figure 5.] The number of possible symbolic expressions of 
acceptable length is extremely large, and a wide variety of textures 
and patterns can occur. Completely unexpected kinds of images 
have emerged. Figure 9 was created from the following evolved 
expression: 


(round (log (+ ¥ (color-grad (round (+ (abs (round 
(log (+ ¥ (color-grad (round (+ y (log (invert y) 15.5)) 
x) 3.1 1.86 #(0.9S 0.7 0.59) 1.35)) 0.19) x)) (log (invert 
y) 15.5)) x) 3.1 1.9 #(0.95 0.7 0.35) 1.35)) 0.19) x) 





Figure 1] 1. 
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Figure 13 was created from this expression: 


(sin (+ (- (grad-direction (blur (if (hsv-to-rgb (warped- 
color-noise #(0.57 0.73 0.92) (/ 1.85 (warped-color- 
noise x y 0.02 3.08)) 0.11 2.4)) #(0.54 0.73 0.59) #(1.06 
0.82 0.06)) 3.1) 1.46 5.9) (hsv-to-rgb (warped-color- 
noise y (/ 4.5 (warped-color-noise y (/ x y) 2.4 2.4)) 
0.02 2.4))) x)) 


Note that expressions only five or six lines long can generate images 
of fair complexity. Equations such as these can be evolved from 
scratch in timescales of only several minutes - probably much faster 
than they could be designed. 

Figures 10, 11, and 12 were also created from expressions of 
similar lengths. Fortunately, analysis of expressions is not required 
when using these methods to create them. Users usually stop at- 
tempting to understand why each expression generates each image. 
However, for those interested, expressions for other figures are 
listed in the appendix. 


Two different approaches of user selection behavior are possi- 
ble. The user can have a goal in mind and select samples that are 
closer to that goal until it is hopefully reached. Alternatively, the 
user can follow the more interesting samples as they occur without 
attempting to reach any specific goal. 

The results of these various types of evolved expressions can 
be saved in the very concise form of the final genotypic expression 
itself. This facilitates keeping large libraries of evolved forms which 
can then be used to contribute to further evolutions by mating them 
with other forms or further evolving them in new directions. 


6 FUTURE WORK 


Artificial evolution has many other possible applications for com- 
puter graphics and animation. Procedures that use various other 
forms of solid noise could be explored, such as those that create ob- 
jects, create density functions, or warp objects (20, 15]. Procedures 
could be evolved that generate motion from a set of rules (possibly 
cellular automata, or particle systems), or that control distributions 
and characteristics of 2D objects such as lines, solid shapes, or brush 
strokes. Algorithms that use procedural construction rules to create 
3D objects from polygons, or functions that generate, manipulate, 
and combine geometric primitives could also be explored. 

These techniques might also make valuable tools in domains 
beyond computer simulations. New possibilities for shapes and 
textures could be explored for use in product design or the fashion 
industry. 

Several variations on the methods for artificial evolution de- 
scribed above might make interesting experiments. Mutation fre- 
quencies could be included in the genotype itself so that they also 
can be mutated. This might allow for the evolution of evolvability 
[4]. Frequencies from the most successful evolutions could be kept 
as the defaults. 

It might be interesting to attempt to automatically evolve a sym- 
bolic expression that could generate a simple specific goal image. 
An image differencing function could be used to calculate a fitness 
based on how close a test image was to the goal, and an expression 
could be searched for by automatic selection. Then, interactive 
selection could be used to evolve further images starting with that 
expression. 
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Large amounts of information of all the human selection choices 
of many evolutions could be saved and analyzed. A difficult chal- 
lenge would be to create a system that could generalize and “‘under- 
stand” what makes an image visually successful, and even generate 
other images that meet these learned criteria. 

Combinations of random variations and non-random variations 
using learned information might be helpful. If a user picks pheno- 
types in a certain direction from the parent, mutations for the next 
generation might have a tendency to continue in that same direction, 
causing evolution to have “momentum.” 

Also, combinations of evolution and the ability to apply specific 
adjustments to the genotype might allow more user control over 
evolved results. Automatic “genetic engineering” could permit a 
user to request an evolved image to be more blue, or a texture more 
grainy. 


7 CONCLUSION 


Artificial evolution has been demonstrated to be a potentially pow- 
erful tool for the creation of procedurally generated structures, tex- 
tures, and motions. Reproduction with random variations and sur- 
vival of the visually interesting can lead to useful results. Repre- 
sentations for genotypes which are not limited to fixed spaces and 
can grow in complexity have shown to be worthwhile. 

Evolution is a method for creating and exploring complexity 
that does not require human understanding of the specific process 
involved. This process of artificial evolution could be considered 
as a system for helping the user with creative explorations, or it 
might be considered as a system which attempts to “learn” about 
human aesthetics from the user. In either case, it allows the user and 
computer to interactively work together in a new way to produce 
results that neither could easily produce alone. 

An important limiting factor in the usefulness of artificial evo- 
lution is that samples need to be generated quickly enough such 
that it is advantageous for the user to choose from random samples 
instead of carefully adjusting new samples by hand. The computer 
needs to generate and display samples fast enough to keep the user 
interested while selecting amongst them. As computation becomes 
more powerful and available, artificial evolution will hopefully be- 
come advantageous in more and more domains. 
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9 APPENDIX 


Figure 5, Parent expression: 


(warped-color-noise (warped-bw-noise (dissolve x 2.53 y) z 0.09 12.0) (invert z) 0.05 
-2.06) 


Figure 6, Marble torus: 


(dissolve (cos (and 0.25 #(0.43 0.73 0.74))) (log (+ (warped-bw-noise (min z 11.1) 
(log (rotate-vector (4+ (warped-bw-noise (cos x) (dissolve (cos (and 0.25 #(0.43 0.73 
0.74))) (log (+ (warped-bw-noise (max (min z 8.26) (/ -0.5 #(0.82 0.39 0.19))) (log 
(+ (warped-bw-noise (cos x) z -0.04 0.89) #(0.82 0.39 0.19)) #(0.15 0.34 0.50)) -0.04 
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Figure 12. 
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Figure 13. 
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-3.0) y) #(0.15 0.34 0.50) y) -0.04 -3.0) x) z y) #(0.15 0.34 0.5)) -0.02 -1.79) -0.4) 
#(-0.09 0.34 0.55)) -0.7) 


Figure 7, Cross dissolve: 


(hsv-to-rgb (bump (hsv-to-rgb (ifs 2.29 0.003 (dissolve 1.77 3.67 time) 2.60.1 (dissolve 
5.2 3.2 time) -31.0 (dissolve 23.9 -7.4 time) (dissolve 1.13 9.5 time) (dissolve 4.8 0.16 
time) 20.7 4.05 (dissolve 0.48 0.46 time) (dissolve 2.94 -0.68 time) (dissolve 0.42 0.54 
time) (dissolve 0.09 0.54 time))) (atan 2.25 (dissolve 0.1 0.11 time) 0.15) (dissolve 
4.09 8.23 time) (dissolve #(0.41 0.36 0.08) #(0.68 0.22 0.31) time) #(0.36 0.31 0.91) 
(dissolve 6.2 4.3 time) (dissolve 0.16 0.40 time) (dissolve 2.08 0.23 time))) 


Figure 8, Fire of Faces: 


(+ (min 10.8 (warp-rel image image (bump image x 9.6 #(0.57 0.02 0.15) #(0.52 
0.03 0.38) 3.21 2.49 10.8))) (dissolve #(0.81 0.4 0.16) x (dissolve y #(0.88 0.99 0.66) 
image))) 


Figure 10: 


(rotate-vector (log (+ y (color-grad (round (+ (abs (round (log #(0.01 0.67 0.86) 0.19) 
x)) (hsv-to-rgb (bump (if x 10.7 y) #(0.94 0.01 0.4) 0.78 #(0.18 0.28 0.58) #(0.4 0.92 
0.58) 10.6 0.23 0.91))) x) 3.1 1.93 #(0.95 0.7 0.35) 3.03)) -0.03) x #(0.76 0.08 0.24)) 


Figure 11 is unfortunately “extinct” because it was created before 
the genome saving utility was complete. 


Figure 12: 


(cos (round (atan (log (invert y) (+ (bump (+ (round x y) y) #(0.46 0.82 0.65) 0.02 #(0.1 
0.06 0.1) #(0.99 0.06 0.41) 1.47 8.7 3.7) (color-grad (round (+ y y) (log (invert x) (+ 
(inven y) (round (+ y x) (bump (warped-ifs (round y y) y 0.08 0.06 7.4 1.65 6.1 0.54 
3.1 0.26 0.73 15.8 5.7 8.9 0.49 7.2 15.6 0.98) #(0.46 0.82 0.65) 0.02 #(0.1 0.06 0.1) 
#(0.99 0.06 0.41) 0.83 8.7 2.6))))) 3.1 6.8 #(0.95 0.7 0.59) 0.57))) #(0.17 0.08 0.75) 
0.37) (vector y 0.09 (cos (round y y))))) 
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